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A THEORY OF RANDOMNESS 
By M. G. KENDALL 
Introd uction 

1. In two recent papers Babington Smith & I (1038 and 1039) have discussed 
the problems of sampling with random numbers and the construction of tables 
of such numbers by mechanical methods. With the publication of 100,000 
numbers (1040) what one may call the practical side of the investigation has 
come to an end. The purpose of this paper is to develop the theory of the subject 
and to put in their proper setting some of the ideas on which the practical re¬ 
search was based. It is divided into two parts. In the first I deal with the 
symbols and mathematics of the theory of random suites, my fundamental 
contention being that a theory of randomness can be developed within the 
framework of existing mathematical notions. The second part indicates how the 
theory is to be related to practice. 

2. Much of the following work was suggested by the treatment of von Mises 
(1930) and Dorgc (1934), and I take the opportunity of expressing my indebted¬ 
ness to them. The principal difference between von Mises’s views and my own 
concerns his concept of the Irregular Kollektiv, or infinite random series. 
Numerous attempts have been made to show that this concept leads to a contra¬ 
diction and that it is therefore an improper foundation for a theory of proba¬ 
bility. Such attempts have mostly failed, but under pressure of the criticisms 
embodied in them the definition of the Irregular Kollektiv has been successively 
modified by von Mises’s followers until it has lost the pristine simplicity which 
was originally one of its most attractive features. I do not propose to discuss 
here the difficulties associated with the concept ol the Irregular Kollektiv or the 
various expedients which have been proposed to meet them. I have tried to cut 
the Gordian knot by rejecting the concept, and the theory below accordingly 
avoids all the difficulties attendant upon it. 

Part 1. The theory of random suites 

3. I consider a finite number r of symbols A v A 2 , A,., each of which will 

be called a characteristic, and an infinite ordered series of these characteristics, 
which will be called a suite. For instance, if there were two characteristics A x 
and A z such a suite might be 

A j A z A^x4. z A^A 2 A^A z .( 1 ) 

where the characteristics appear alternately. Suites exist in the sense that they 
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2 A theory of randomness 

can be completely specified by a law of formation, as the foregoing example 
shows. 

4. Definition. If the proportional frequency of each characteristic in a suite 
tends to a limit in the mathematical sense, the suite is called “proper”. In the 
contrary case, “improper”. 

Proper suites exist; e.g, the proportional frequencies of Aq’s and d a ’s in (1) 
tend each to the limit 

Improper suites also exist. For example, it may be shown that if we take the 
ith digit in the logarithms to base 10 of all the integers, beginning with 1, the 
suite of the digits 0-9 so obtained is improper, since no proportional frequency 
tends to a limit. 

Suites also exist which are proper for one characteristic and improper for 
another, provided that there are more than two characteristics. For we may 
build a suite from the logarithm table in the manner just described, and then 
insert a new characteristic Q between successive digits. The proportional 
frequency of Q will then tend to J, but those of the others will not tend to a limit. 
If, however, there are two characteristics, the proportional frequencies, being 
together equal to unity, must tend to a limit together. 

5. Definition. The limit of the proportional frequency of a characteristic in 
a proper suite is called the probability of that characteristic in that Buite, 

6. Definition. By a “Selector” I mean an infinite series of positive integers 
ordered according to their magnitude. A selector, being infinite, must be 
specified by a law of formation, not by enumeration. 

There is a special class of such laws which deserves separate consideration. 
Suppose we have such a suite as this: 

A l A i A 2 A 1 A l A 2 A i A 1 A 1 A' i A i A 1 ..., .(2) 

characteristics after the first appearing alternately in pairs. 

Our law of formation of the selector might be in these terms: proceed along 
the series until you come to the combination A 2 A 1 A 1 A 2 ; then choose the ordinal 
number of the next following member of the series, and proceed until you again 
meet that combination; and so on. The series of ordinals so obtained is the 
selector. 

The importance of selectors of this type is that they are mathematically 
independent of the particular characteristic of the member whose 'Ordinal 
number is chosen. By mathematically independent I mean that the value of this 
member does not appear in the law of formation, so that the same member would 
be chosen whatever its characteristic. 

Definition. If a seleotor is constructed from a suite and, in virtue of the law 
of formation, any member of the selector is mathematically independent of the 
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characteristic whose ordinal number in the suite is the value of that member, 
the selector is said to be “disjoint” with respect to the suite. 

7. It might happen that a law of formation of a disjoint selector was given 
which did not in fact lead to a selector in the case of certain suites. For example, 
with the suite (2), if we try to construct a selector by choosing ordinals corre¬ 
sponding to the characteristics following three successive A^a, no ordinals 
appear. Such a law I should regard as degenerate in relation to that suite, and 
I exclude it from the domain of discussion from this point onwards. Hereafter, 
in speaking of a selector in relation to a given suite I shall assume that the one 
is disjoint with respect to the other. 

8. We may now apply selectors to pick out subsets from a suite. We do so by 
choosing from the suite those members whose ordinals are the numbers appearing 
in the selector. 

Ex hypothesi, the result of this process will be a new suite of the charac¬ 
teristics (some at least) of the original suite. We may call this a “ Derived suite 
Symbolically, denoting the selector by the roman S and the suite by K, we may 
write D = SK. .(3) 

I proceed to prove one or two theorems of a negative kind about derived 
suites. 

9. A suite derived from a proper suite is not necessarily proper. 

For let K = A 1 A 2 A 1 A 2 A 1 A 2 A 1 A 2 ..., 

S = 1, 2, 4, 5, 7, 9, 11, 13, 15, 16, 18, 20, 22, ..., 
the numbers running alternately in sets of even and odd, the number of each 
kind being equal to twice the number of preceding members of the selector. 
Then Sif _ A 1 A 2 A 2 A 1 A 1 A 1 A 1 A 1 A 1 A 2 A 2 A 2 A 2 A 2 A 2 .... 

However far we go in this series, say to the end of a run of d 1 ’s, there will follow 
twice as many A 2 ’ s as there have already occurred of both dj’a and A 2 ’s. Clearly 
the suite is improper. 

10. If we apply another selector S 2 to a derived suite we get a further 
derived suite which we may write S 2 S X K. It is clear that this will not in general 
be the same as S 1 S 2 jK'. 

The “identical” selector E = 1, 2, 3, 4, ... is of some importance. Clearly it 
reproduces a suite to which it is applied, and ES X A = SjEif, etc. 

Randomness 

11. Definition. If the probability of the characteristic A 1 in a proper suite 
is p ; and if the probability of A t in a proper suite derived from it by the selector 
S is also p; then the suite is said to be random for the characteristic A x with 
respect to S. 

Suites and selectors with this property exist. Every proper suite is random 
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with, respect to the identical selector E, and the suite (1) is random for both -lj 
and A 2 with respect to the selector 

1,4, 7, 10,..., 

though not to the selector 1, 3, 5, 7, .... 

12 . Definition. A suite which is random for a characteristic A with respect 
to a number of selectors S ls S 2 , ..., S TO is said to be random in the selector domain 

It is to be noted that if a suite is random with respect to and b 2 it does not 
follow that S ± K is random with respect to S 2 or S t K with respect to S x . E.g. if 

K ~ A 1 A 1 A 2 A i A 1 A 1 A 2 A s ,..., 

S^ l, 3, 5,7, 9,..., 

S a = the disjoint selector obtained by writing down the ordinals of charac¬ 
teristics next following A lt 
then S 1 K = A 1 A 2 A l A 2 A 1 A 2 ..., 

S 2 -E = A y A 2 A y A 2 A y A% ..., 

so that K is random for A y and A 2 with respect to both and S a . But 

S 2 Sjif — A. 2 A 2 A 2 A^A 2 A 2 ..., 

SjS a /f — AiAiA y A y A y A^ .... 

13. Given a suite and a certain finite set of selectors, we may consider the 
suites obtained by repeated applications of groups of these selectors. This will 
give us a series of derived suites which may be infinite but is nevertheless 
ordered. If all the resulting suites are proper and the probability of a charac¬ 
teristic A in them all is the same as that in the parent suite, the latter is said to 
be completely random in the selector domain S x , S 2 , ..., S. m . 

There exist suites which are completely random in certain domains. E.g. if 

K. — A y A 2 A y A 2 A y A 2 
Si =1,4, 7, 10,..., 

then Sj.iT = A^A^A^ ... = if, 

so that repeated applications of S x lead only hack to the original suite. 

Consider further S 2 = 2 , 5, 8 , 11 , .... 

Then S 2 iC = A S A 1 A 2 A 1 A 2 A 1 ... 

and S|iT = A 1 A 2 A 1 A Z A 1 A 2 .... 

Thus, any number of applications of S x and S 2 lead either to A 1 A 2 A 1 A 2 A 1 A 2 ... 
or to A< i A 1 A i A l A 2 A 1 ..and hence the suite is completely random for A y and 
A 2 with respect to S L and S 2 . 

It follows at once that any suite which is derived from a completely random 
suite by a selector of the set is also completely random within the same domain. 
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14. The foregoing examples, though trivial enough, show that the various 
ideas which have been introduced are not self-contradictory, and that they fall 
within the scope of ordinary mathematical concepts. But the use of the word 
“random” to describe a state of aff airs which is the reverse of what is ordinarily 
understood by the word requires some scholium. I have, for instance, remarked 
that any suite is random with respect to the identical selector 1, 2, 3, 4, .... But 
surely, it may be said, such a suite as A 1 A 1 A 1 A 1 A 1 A 1 A 1 A 1 A x ... is the very 
reverse of random, being as systematic as any such series can be % I will antici¬ 
pate a later part of this paper to some extent by a short explanation of this 
point. 

15. In statistical work we require one thing above all in a “random” 
selection; namely, that if continued long enough it shall draw all members of the 
universe equally often, or at least in a known proportion. In fact, it is not the 
haphazard quality of randomness that we use in drawing inferences from random 
samples, but the only thing about it which is not haphazard, namely its property 
of producing definite limits. (I am, of course, speaking colloquially.) Any 
method of selection would serve equally well if it satisfied this primary requisite. 
The “random” series with which we are familiar in ordinary work are un- 
purposive and chaotic in appearance for two reasons: firstly, because we fondly 
hope to have a series which gives a suite random in regard to all possible 
selectors, so that it can be used to draw random samples from all universes 
whatever the characteristic under consideration; such a series must be random 
in a very wide selector domain, including all the more obvious selectors which 
would give the series a purposive appearance, and consequently it looks un¬ 
systematic ; secondly, as an experimental fact we have learnt that when a sample 
is chosen haphazardly it is often random, at least so nearly so that ordinary in¬ 
spection of a series of results will not reveal the difference; this haphazard 
selection leads us to expect from it an unpurposive-looking result. 

16. But there is no virtue in lack of purpose for its own sake. In fact, 
random sampling has a very definite purpose, and it is the purposive parts of it 
that we have in mind in using the method at all. As the domain of selectors 
becomes larger, the random suite becomes more and more like the completely 
haphazard entity which von Mises would like to make the basis of his theory; 
but in my view the random suite must always be considered as random in a 
finite domain. I contend that there is no such thing as absolute randomness, 
just as there is no such thing as absolute velocity. The latter has meaning only 
with reference to a co-ordinate framework, the former only with reference to a 
selector framework. I might summarize the attitude of the foregoing paragraphs 
by saying that they are founded on the concept of the relativity of randomness. 
If this be agreed, the difficulties about terming “random” certain series which 
do not conform to the colloquial use of the wmrd at once disappear. 



6 


A theory of randomness 
Multi-dimensional suites 

17, As a simple extension of the idea of a suite of characteristics we may con¬ 
sider suites of sets of characteristics. Such an extension offers no difficulty, and 
is very similar to the transition from describing points on a line in terms of'one 
•co-ordinate to points in a multi-dimensional space by several co-ordinateB. 

We may amalgamate two or more suites into a suite of more dimensions. 
E.g. with the suites 

B^B.B^B^B^..., 

we can associate the nth member of one with the nth member of the other to 
obtain (Ax Bj (A a iy A3») (A Z B X ) (A x B t ) (A % B Z ) .... 

Convolution 

18; We may also construct an wi-dimensional suite by dividing a one¬ 
dimensional suite into blocks of m. This process is worth noticing. Consider the 
suite A^A^AjA^A^.... 

This is proper and each characteristic is random with respect to the selector 

1, 3, 5, 7, 9, .... 

Now-suppose we make a two-dimensional suite by bracketing successive terms, 

thus: (AA) { A 2 a i} (A^MAA).... 

This is proper with respect to the two two-dimensional characteristics A^a) 
and A A x ) but it is not random with respect to the selector given. 

Definition. I shall refer to the process of deriving a multi-dimensional suite 
from a one-dimensional suite by grouping sets of successive terms as “convolu¬ 
tion”, and the derived suite will be said to be “ convoluted”. Erom the example 
given it is clear that a convoluted suite is not necessarily random in the domain 
of randomness of the parent suite. 

Independence 

19. Definition. If a two-dimensional suite is derived from two one-dimen¬ 
sional suites by attaching one member of the first to the member with the same 
ordinal number in the second; if the original suites and the new suite are proper; 
and if the probability in the derived suite of a characteristic {A^B k ) is the pro¬ 
duct of the probabilities of A i in the first suite and of B k in the second for all j 
and k] then the two original suites are said to be statistically independent. 

Definition, If from a proper two-dimensional suite there are derived two 
proper one-dimensional suites by ignoring tbe first and then the second charac¬ 
teristic of the pairs which constitute the suite; and if these two suites are 
statistically independent, the two sets of characteristics are said to be statistically 
independent in the original suite. 
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Statistical independence as thus defined concerns either suites or charac¬ 
teristics in suites. Like probability it is a property of aggregates, not of indi¬ 
viduals. 

20. The generalizations of statistical independence to the case of several 
suites or multi-dimensional suites can be made without difficulty. I shall here 
omit them and the theorems which they obey, since all the results are obvious 
extensions of the theory of class frequencies set out for the case of finite classes 
in the Introduction by Udny Yule and myself (1939). The following results are, 
however, worth recalling; 

(a) If K, L, M are proper suites, K is statistically independent of L and 
L is statistically independent of M ; it does not follow that K is statistically 
independent of M. 

( b ) Three suites are statistically independent only if the probabilities 
(. A j B k G l ) are equal to the product of the probabilities A } in K, B k in L, and 
in M. It is not sufficient that they should be statistically independent pair and 
pair, as the following example shows: 

K = A^A^A^A^A^A^A^A^.. 

L = B l B 2 B 2 B l B l B 2 B2B 1 ..., 
2f-C r 1 0 1 0 1 0,0 l (7 1 C t f7 t .... 

Here, for instance, the probability of (^I^C^) is zero in the suite obtained by 
associating triads from members of the suites which have the same ordinal. 

Local randomness 

21. A suite as defined above is infinite. I now consider a finite series of 
characteristics which I call a sequence. A sequence may be considered as a 
section of a suite. 

It is evident that any sequence, being finite, can form part of a suite in 
which the probabilities have any given value and which is random in any as¬ 
signed domain. We may, however, imagine the selectors of the domain applied 
to the sequence, that part of the selectors which contains numbers greater than 
the number of members in the sequence being ignored. Similarly, we can 
convolute the sequence in any way consistent with its size and apply selectors 
to the sequences so derived. We can compare the actual proportions of charac¬ 
teristics in these sequences with those in any given suite. Any such process 
I call a test. 

Definition . If the proportional frequencies in a sequence are approximately 
what they would be in a suite random with respect to the selectors of the test, 
the sequence is said to be locally random with respect to that test; and so for a 
test domain. 

To make this definition precise it is necessary to consider what is meant by 
‘•approximately”. Suppose the sequence is of size n, and consider ther B possible 
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sequences of this size. To each there will correspond a proportional frequency 
under the tests. Choose a number of these, ctr n , which may be regarded as 
“approximately” the same as the proportional frequency in the suite. Then if 
the given sequence is one of these, it is “approximately” the same. Clearly the 
word approximately depends on the choice of the number a which corresponds 
to what is generally known as a “level of significance”. 

The concept of local randomness, in my view, is important. The series of 
characteristics which we encounter in real life are always sequences, not suites; 
and we have to estimate probabilities and random properties from finite aggre¬ 
gates, not from infinite series such as form the basis of the theory. Any finite 
series of characteristics whatever is random in the sense that it might arise, how¬ 
ever infrequently, in random sampling. But in order to make any practical use 
of our theory we have to consider certain series as non-random, or in other words 
we have to judge from the local randomness of observed sequences. 

Part 2. Application op the theory to practice 
Events 

22. Events are the primary data of statistical experience. Every event has 
a number of properties, the conceptual abstractions of the Gestalts which it 
provides. These properties may be called characteristics, and it is with aggre¬ 
gates of characteristics that statistical inference is concerned. The throwing of 
a die and the growing of a crop on a given field are events. Characteristics of the 
former would include the number which came uppermost, the time at which the 
throw was made, the angles which the edges made with a line fixed in space, and 
so on. In general an event has an infinite number of characteristics. When we 
have a complex phenomenon such as a crop of wheat on a field it is a matter of 
choice whether we regard the whole thing as one event, or look on it as a series 
of associated events, e.g. a collection of crops on a number of square yards. But 
the event is to be regarded as including the whole of the happening, and is not 
synonymous with one of its characteristics. A yield of wheat is not an eyent, 
nor is the number thrown by a die. 

23. Consider then an aggregate of events. Suppose there exists a finite set 
of characteristics such that each event has one and only one characteristic of the 
set; for example, the events consisting of throws of an ordinary die must have 
one of the characteristics 1-6, according to the number which falls uppermost, 
and cannot have more than one. We can then say that the aggregate of events 
gives rise to an aggregate of characteristics. 

24. Now the aggregates of events we meet in experience are always finite. 
It is true that we sometimes regard a line as composed of an infinite number of 
points or a solid body as an infinite number of particles; but these are mental 
fictions and it is not possible to observe the characteristics of an infinite number 
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of events. The finite aggregates of our experience can be ordered to produce a 
sequence—in fact they are usually arranged for us by the temporal order in 
which they occur. The fundamental problem of linking theory and practice— 
and an analogous problem arises in all frequency theories of probability—is to 
relate the sequences of observation with the suites of theory. 

25. The sequences of observation may be regarded as generated by a 

physical process. The sequence consisting of throws of a die, for example, may 
be considered as defined by the rules under which the die is cast. A sequence of 
crop yields is determined by the circumstances under which the crop was grown. 
I shall assume that the physical process generating a sequence can be depicted 
by a mathematical law defining a suite, in the same way that the “straight 
lines” we draw on paper can be depicted by the straight lines of Euclidean 
geometry, or a rigid body by the abstractions of mathematical dynamics. 
Members of a sequence are ascertained by experiment; those of a suite by 
calculation. ' 

26. I also take it as empirically established that there are observational 
sequences which can be adequately described as locally random sections of 
suites; random, that is, in certain domains. And I assume that the processes 
generating these sequences will, if continued, produce further sequences which 
are also locally random. This is essential to all scientific inquiry, that a law 
which is established will continue to operate. If it does not, we must alter the 
law; but before carrying out the extra trials we can only act on the assumption 
that the law will hold. Put in this way, perhaps, the assumption seems un¬ 
justified, but it is made every moment of our lives. In writing these words I am 
assuming that a past phenomenon will recur, namely that a particular arrange¬ 
ment of marks on a piece of paper will evoke certain ideas in the reader’s mind. 
This much I am compelled to concede to those writers, like Mr Keynes and Dr 
Jeffreys, who contend that probability cannot be defined in terms of frequency, 
namely that the uncertain attitude of mind which one adopts towards some 
laws cannot be measured by probability as I have defined it. As to the phe¬ 
nomenon of mental doubt, the scientific procedure by which it is removed or 
strengthened, the desirability of measuring it, I am in agreement with Dr 
Jeffreys; but I do not call the measure probability in a statistical context. This 
is only a matter of words, but unfortunately so is a great deal of statistical 
discussion. 

27. The probability of a characteristic in a physical process is to be estimated 
from the observed sequence to which the process gives rise. I need not dwell 
here on the methods of estimation and the ideas which underlie them. But there 
is one point of some importance to note. It is evident that probability is a 
property of characteristics, not of events, and to be strictly accurate we should 
always speak of it as such. Eor instance, if I toss a penny on to a chessboard, the 



10 A theory of randomness 

limit of the proportion of heads may be |, but the limit of the proportion, of cases 
in which it falls on a white square may be f. Neither of these fractions is the 
probability of the event. They are probabilities of characteristics and there is 
nothing inconsistent in the fact that they are different. 

Independence 

28. The statistical independence of two suites, or of characteristics in a 
multi-dimensional suite, was defined in paragraph 19, and the statistical inde¬ 
pendence of observed sequences follows the same line. In statistics the question 
whether two series of characteristics are independent is to be determined purely 
from the experimental data. There may be very good reasons why one charac¬ 
teristic is “dependent” on another in a causal sense, but if the occurrence of one 
is not accompanied by the occurrence of the other in “unexpected” proportion 
they are statistically independent. Contrariwise, there may be no obvious causal 
nexus and yet the two may be statistically dependent. In fact, I would be in¬ 
clined to deny any separate meaning to “ causal” dependence other than that of 
statistical dependence (with perhaps, allowance for the temporal element). 

29. This point is important in one respect. I have up to the present spoken 
only of the independence of characteristics, not of events, and even of the former 
only in terms of suites or sequences. But in the theory of probability as ex¬ 
pounded in textbooks it is quite common to meet with such expressions as “a 
series of independent events”, or “successive events are independent”. The 
word “event” here means what I call a characteristic; but can we speak of “a 
suite of independent characteristics” 1 I do not think so. In my opinion the 
concept is equivalent to that of the Irregular Kollektiv of von Mises. 

30. For example, one would be inclined to begin an approach to a definition 

of the concept by requiring that each characteristic was followed equally fre¬ 
quently by all characteristics of the suite, e.g. that in a suite of characteristics 
A 1 A 2 , an A 1 was followed equally frequently by an A 1 or an A z . But this is true 
of the suite AAAAAAAA 

which is clearly not of. the. type desired. One might then require that each 
characteristic should, in addition, be followed next but one by all other charac¬ 
teristics in equal amount. But this is true of the suite consisting of repetitions of 

A 1 A Z A Z A Z A-^A X A 1 A Z ..,. 

Baffled by continual examples of this kind, one might then require that the 
occurrence of any characteristic was to be independent of all or any of the 
characteristics which have preceded it. This, on analysis, is found to be equi¬ 
valent to the requirement that the suite shall be random with respect to all dis¬ 
joint selectors of the type considered in paragraph 6; and this is precisely the 
difficulty of the Irregular Kollektiv. 
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31. I can see no way round this difficulty; and I therefore reject the suite of 
independent events as I reject the Irregular fcollektiv. This has two important 
consequences, the first concerning the ordinary theorems of probability and the 
second concerning Bernoulli’s theorem. 

The first point may best be illustrated by an example: suppose the proba¬ 
bility of getting a head with a toss of a penny is f. What is the probability of 
getting two heads with consecutive tosses? Anyone grounded in the classical 
theory would answer “ without hesitation. Nevertheless, the result is only 
true under certain conditions. In fact the data of the problem are that there is 
a suite of throws of the penny and that the proportional frequency of heads in 
this suite is Now such a suite might be (A x = heads, A 2 = tails) 

A.^A-2 A.-y A^A^A^ ..., 

and the probability of getting two successive heads is zero. But, it may be ob¬ 
jected, this is an artificial series which would never occur. To this I should reply, 
Agreed, but why should there not occur a natural series in which the proportional 
frequency of pairs of heads did not tend to J ? It will, I hope, be clear on re¬ 
flection that there is nothing in the data of the problem to require the answer J 
as' a logical necessity unless we make some additional assumption such as this: 
the occurrence of one characteristic is statistically independent of the occurrence 
of the next. This contains the answer of £ implicitly. 

32. In generalization of this problem we might ask: if the probability of the 
characteristic is p, what is the probability that in a set of n characteristics we 
shall get r successes and n — r failures? Here, again, the answer of the classical 

theory would be p r (1 —p) n ~ r \ and here again the result is only true if we 

assume the statistical independence of sets of n. Clearly if the result is to be 
true for all n wp are once more verging on the suite of independent charac¬ 
teristics referred to in paragraph 30. I conclude that for statistical purposes the 
results of the classical theory of probability are not to be accepted without 
examination. If in any particular case we require one of these results, we must 
be satisfied that the suite we are considering is such as to justify the use of it. 

Bernoulli’s theorem 

33. The well-known, theorem given by James Bernoulli in the Ars Coniectandi 
is subject to similar limitations in regard to its statistical applications. In 
essence the theorem is a proposition in algebra which may be stated thus: in the 
binomial distribution (p + q) n , if u be the sum of the greatest term and the n 
preceding and n succeeding terms, the ratio of u to the sum of the remaining 
terms may be made as large as we please by increasing n sufficiently. There can 
be no criticism of this result. But, as applied to statistical series, the theorem 
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states that if the probability of a characteristic is p and we observe m sets of n 
events, the proportion of sets in which the proportion of successes differs from 
.p by less than e tends to unity with large m and large n. Or put another way, 
the probability that the proportion of successes in a set of m differs from p by 
less than e tends to unity as m tends to infinity. Symbolically. 

P {| p-h(A ) | <*?}>! — i), m > M, 
where h(A) is the proportion of successes. 

34 . This is a statement about the probability of a probability and L need not 
emphasize the logical weakness inherent in it. On looking into the proposition 
further we find that it is dependent on the type of assumption considered in 
paragraph 32, namely, that if the probability of a characteristic is p, the proba¬ 
bility of r characteristics in a set of n is p r (I —p) ur . 

This will only be true if the observed series is, approximately at least, a series 
of “independent” characteristics. Consider, for example, the suite 

AyAyAyiA^AyAyA^AQ***' 

the characteristics of which are random with respect to the selector 

1,3, 5,7,9,... 

and have probabilities each equal to Consider the convoluted suite 

(AA) (AA,) {AyAj cvy. 

Bernoulli’s theorem, as usually stated, would lead us to the conclusion that the 
probability of getting in this suite a pair containing one Ay and one A t is L 
Actually it is zero. 

If, once again, it is objected that this is a highly artificial series, 1 reply as 
before that series with the same properties might arise naturally. We can only 
make Bernoulli’s theorem legitimate by postulating that the suite to which it 
is applied shall have the property of randomness under convolution for all n. 
I do not think tills is always a legitimate hypothesis to make, but I am anxious 
not to be misunderstood on the point. There undoubtedly exist sequences which 
can be regarded as belonging to suites random in a very wide domain- so wide 
that for many practical purposes they can be taken to be sequences of "inde¬ 
pendent” events. The point to be stressed is that this assumption underlies a 
great deal of statistical work but is never brought to light, and indeed is often 
not realized. The statistician takes it for granted; but to the philosopher nothing 
is more surprising than the orderly disorder which is common in Nature. 

Random sampling 

35. A statistical universe is an aggregate of objects, which may he finite or 
infinite. I consider selective processes which consist of abstracting one member 
at a time from this universe, and I suppose that each member is returned to the 
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universe after drawing if the universe is finite. The abstraction of each member 
may then be considered as an event, whose characteristics may be noted. 
I assume that there exists for these events a set of characteristics such that each 
event must bear one characteristic and can bear only one. 

We may then imagine this selective process, which I shall call sampling, as a 
generator of a sequence of any desired extent, and to be capable of continuation 
without limit. The result of unterminating sampling would be to give a suite of 
events, to which there correspond one or more suites of characteristics. In 
practice we shall have a sequence only. 

36. Definition. If a sequence obtained from a universe U by sampling is 
locally random for a characteristic A within a selector domain D, then the 
sampling is said to be random for U with respect to A within the domain D; 
and any member of the sequence is called a random sample for the characteristic 
A in the domain D. 

This definition brings out the extremely relative nature of random sampling. 
A method which is random for one universe may not be random for another; a 
method random for one characteristic may not be random for another,, even in 
the same universe; and the randomness is always relative to the selector domain. 
It is also to be noticed that the sampling process, being physical, can only be 
related to sequences, not to suites. 

The assumption we make in using a random sampling method is that if it has 
in the past generated locally random sequences it will continue to do so in 
similar circumstances. The justification for this assumption is empirical. 

37. In practice we sometimes draw samples one at a time and so obtain a 
one-dimensional sequence. We then convolute this sequence into groups of n, 
making an ^-dimensional sequence. But we may also draw the samples in a 
block of n (which I shall call a “ clutch”). The difference is of some importance. 
If we ignore the order of the individuals in a convoluted sequence we have what 
is virtually a clutch, and it'is very common in statistical work to ignore the order 
in this way. A series of sampling results, for instance, are frequently given with¬ 
out any indication of the order in which the individual results appeared. It 
should not be overlooked that certain information relevant to the randomness 
of the sample has disappeared in the process. For example, we may be told that 
in a sample of 1000 births 510 were male. We should probably conclude that there 
was nothing in the sample to show that it was not random. But if we know that 
the first 510 were-male we should certainly conclude that it was not. 

38. What are the grounds on which a selective process of experience is con¬ 
sidered to give random samples? In the first place, as Ms already been re¬ 
marked, we can only use a selective process t'o produce /finite sequence. This 
sequence is always locally random in some domain or qfher. If we find that, as 
the sequence is increased, local randomness is maintained, we may say that the 
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method is random for the universe and characteristic considered. But we require 
more for a method to be used in practice. We require to be able to suppose with 
some confidence a priori that it will be random for fresh inquiries, fresh universes, 
fresh characteristics and fresh sequences. And we require the domain of random¬ 
ness to be as wide as possible. It was formerly the custom to assume randomness 
to any desired extent if there was no obvious reason to the contrary—a sort of 
Principle of Non-sufficient Reason. This is most unsatisfactory, and could only 
be justified if it was found in practice that haphazard methods of selection give 
locally random sequences. In fact we find that whenever any element of 
personal choice is allowed free play, bias is very liable to appear. 

39, It seems to me that we can never rid ourselves entirely of the possibility 
that a method of selection may lack randomness; but we can safeguard against 
the possibility to a great extent. For instance, the method of Random Sampling 
Numbers applied to a universe of names in a directory gives us something near 
certainty (if I may be allowed that colloquial expression) that the resulting 
samples will he random. Furthermore, we can experiment with a method to see 
if bias has appeared. If it has not, we are justified in expecting that it is random 
for the class of cases in which it has been tried. Ultimately, however, the 
assumption of randomness is part of the hypothesis which is being tested. 

40. An assumption which is usually made in practice is that the method is 
random within whatever domain happens to suit the investigator at the 
moment. One draws a random sample from the universe of inhabitants of the 
British Isles. One says that the sample is “random” without any qualification. 
Behind this lies the assumption of the Irregular Kollektiv which has been con¬ 
sidered from a different angle in paragraph 30. A great many statisticians would 
use such a sample to test any hypothesis about the universe which they chanced 
to encounter; they would assume that it was random in regard to height, sex, 
age or any other characteristic; they would assume that it was random under 
convolution; and they would assume the legitimacy of testing in any sampling 
distribution which happened to be convenient. All of this amounts to an 
assumption of randomness in a very wide domain, depending on a subjective 
judgment which may be quite wrong. The wider the domain, the less likely 
(again speaking colloquially) is the assumption to be justified. In practice this 
assumption frequently has to be made, and can be made without much danger 
with a good sampling method. But the greatest danger lies in the fact that the 
person making the assumption very often does not even realize that he is doing 
so. In any sampling inquiry it is necessary to ask oneself. Is the sampling method 
I am using random fqr the universe I am considering, for the characteristics I am 
discussing, and for the v sampling distributions or tests of significance I am em¬ 
ploying! Randomness irrelative. 
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A GEOMETRICAL ANALYSIS OF THE FREQUENCY 
DISTRIBUTION OF THE RATIO BETWEEN TWO 

VARIABLES 


By C. NICHOLSON, M.C., M.A., M.D. 

This subject lias already been treated by Geary (1930), and by Keller (1932), the 
approach to the problem in both cases being algebraical; the geometrical approach 
to the same problem suggested itself to me when I was working on a series of 
anatomical measurements (Nicholson, 1938). This paper could hardly have 
reached publication without the generous assistance of Mr N. L. Johnson of 
University College, London. 


(1) Variables independent 
We are to consider the distribution of the ratio 


y + Y 
x+X’ 


where Y and X are constants, and the joint distribution of x and tj is given by the 
normal bivariate surface 

1 f lfa? y 2 Y 

Then if we refer our observed values (y+ 7) and (ar+Y) to the axes y = 0 and 
x = 0, jfche co-ordinates of the intersection of the zero values of the variables will 
be —X and - Y, since x-t-X = 0 when x = ~X. The ordinates for a constant 
value of the ratio will lie in a plane surface passing through this point of the 
general form 

y~mx + c, 


where m will have the value of the ratio, and c will be ml - f. The equationto 
the projection of the section of the normal surface by this plane on the (%, z) 
plane will be 




exp 


T lx 2 


2 1 ( 7 : 


; +■ 


{mx + cf \ 


( 1 ) 


which can he rearranged to 


2 ™«<r v 


~ exp 



— 

mw 2 ' 

2“ 

r m c Fi 

1 

X+ V(wV^ + cr«) 


L 2\V(mV> + <rg)J J eXp 
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^/{m 2 cr 2 + (r 2 ) 

- 


(la) 
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The area of this section is the integration of z from — oo to + oo, the variable 
(because of the change of angle) being x^l(l +m?), and this is 
^/(l +m 2 ) 


V(2v)V(mV 2 +o- 2 ) 


eXp _ 2 {v(m 2 <r* + cr*)} 


The quantity 


- in (2) is equal to 


mX-Y 


-tj— v -, - 7 .^, which is the function of 

V(mV! + <r 2 ) £r I + °'§) 

the ratio which Geary treats as t. Now m is the tangent of an angle, say /?, and (2) 
may be put in the form 


1 f 1 1 ceos/? 

*J(27r) sin 2 /? + cr 2 cos 2 /?) eXp |_ 2 (sin 2 /? + cr 2 cos 2 /?) 

Here c cos /? is the perpendicular distance from the origin of the surface to the 
plane y = mx+ c, so that we can draw the conclusion that a series of parallel plane 
sections of a normal surface making an angle of /? with the x axis are normal 
curves with a standard deviation of 



_ u x u v _ 

^/(v 2 sin 2 /?+cr 2 cos 2 /?)’ 

while their areas form another normal curve with the variable c cos /? and with a 
standard deviation of ^2 s j n 2 yj + o-a cos 2 ^ 


(2) Variables correlated 

Clearly, in the case where the primary variables are not independent, it is still 
possible to reduce the distribution to this same geometrical system, so that we 
may discard reference to the primary variables and refer rather to the principal 
axes of the surface generalizing (3) as 
_ 1_ 

(2tt) ^(a 2 sin 2 (a + d) + IA cos 2 (a + d)) 

r Jf fcsin# ) 2_ l 

X6XP L 2 \ V( a2 sin 2 (a + #) + 6 2 cos 2 (a + £?) jj J ’ ( ) 
where, referring to Fig, 1, a and b are the standard deviations in the direction of 
the major and minor axes respectively of a normal surface; also k is the distance 
of a point K from the origin, K being the focus of a pencil of planes cutting the 
surface, and k being equal to *](X 2 + F 2 ). The angle KOA is a, which is the absolute 
value of the angle less than \n between the major axis of the surface and the line 
joining the origin to the intersection of the zero value of the variables; it is 

| tan _1 (F/X) —(?|, 

where 8 is the angle between the x axis and the major axis which is given by the 
equation 

vl-°l 

6 is the angular deviation of any plane from the angle a. which is taken as the 
origin of the pencil, and the value of any ratio will be given by tan (a + 8+6). 

Biometrika xxxu , 
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Frequency distribution of a ratio 

It is clear that if we were to confine ourselves to the distribution of the ratio 
we must use tan (a+ 6+0) as the variable, but in this generalized form it is more 
informative to use the angle itself as the variable. The cumulative frequency for a 
deviate 0 is the content of the two plane angles PKT and P'KT' , and where K lies 
without the bulk of the distribution the content of the angle PKT will be negli¬ 
gible. The content of the angle P'KT' may then be taken as the content between 



Pig. 1. Projection, of normal bivariate surface to illustrate the geometry of 
the ratio between variables. 


the two parallel planes, TKT’ and SOS', the latter passing through the origin of 
the surface, and this is 

_ 1 _ 

^(2?r) sin 2 {<x+0)+ 6 2 cos 2 (a+ 6)) 

/’festn# r 1 ( u 1 2- 1 

Jo CXP L 2 (y^a 2 sin 2 (a-f 0) +6* cos 8 (a+ i9))j _ ^ U ' ^ 

If in (5) we put —ryra- n -zr, as t, we get 

•s/(a 2 sin 2 (a + 6) + b 2 cos 2 (a + 6)) 6 

_ feeing _ 

l j'V(o'8in , (c!+0) + & , cos , (a+0)) 

^J„ < 5 «> 

where, if the variables are independent 

__ bsinO ___ c cos (a + 6>) _ mX-Y 

f(a 2 sin 2 {a + 6) + b 2 cos 2 (a+d)) \j(a 2 sin 2 (a + 9) + 6 2 cos 2 (a + 0)) ~ + erf) ’ 

so that (5) is identical with Geary’s formula. 
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Continuing to regard d as the variable, the equation to the curve for the 
frequency distribution given by (5) is the moment of the area (4) about K, and this 
can be arrived at either geometrically or by differentiating (5) with regard to d\ 
it is 

_ 1 f fc(a 2 sina sin (a. + 6) 4- b 2 cos a cos (a. + d))| 

^ *J(2n) \ (a 2 sin 2 (a+ #) + 6 2 cos 2 (« + $))* ) 


II ksinff 

x ex P _ 2 j \/( a2 si 1 * 2 (a + 6) 4- 6 2 cos 2 (a + &)) 


( 6 ) 


Here the numerator of the factor within brackets may be put in the form 

k ^(n 4 sin 2 a + 6 4 cos 2 a) sin (a + y + d), (7) 

where y is the absolute value of the angle which the axis conjugate to POP' makes 
with the major axis of the ellipse, i.e. where tany = (6 2 /a 2 ) cot a. It is thus seen 
that the curve is limited, the range of 6 being from — (a+y) to n — (a + y); at these 
angles the value of y is zero. 

It should be noted that the content between the planes SOS' and TKT' is 
equal to the content of the angle P'KT' minus the content of the angle PKT, so 
that in the integration the value of the content of the angle PK T is twice neglected. 
It follows that if we integrate (6) between the limiting values of 6 the total amount 
neglected is twice that part of the surface which lies beyond a plane GKG' which 
passes through and makes an angle of y with the major axis, the value is 


2 r°° 

~j77r-\ e-^dt, 

V(^) J kl<r* 

where cr a is the standard deviation of the normal curve given by a plane section 
of the normal surface which makes an angle of a with the major axis, i.e. where 


_ __ .ab 

* yjia 2 sin 2 ot + b 2 cos 2 a)' 


(3) Curves given by this equation 

The curves generated by this equation are of very great variety, the majority 
being bell-shaped, and we may now discuss the effects produced by varying the 
constants. It should be noted that, as POP' bisects the normal surface, the origin 
of the curve is neither the mean nor the mode but the median. 

‘k” may have any value from 3<r a or 4cr a up to infinity; as k tends to infinity 
the distribution of the standardized variable 6 tends to normality with a standard 
deviation of 

V(« 2 sin 2 oi + b 2 cos 2 a). 

As k decreases in value the departure from the form of the normal curve becomes 
more marked. 
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Frequency distribution of a ratio 

“alb” may have any value from unity to infinity. When the value is unity the 
curve is very similar in form to the normal curve; as the value of (ijb increases 
departure from the form of the normal curve increases. 

On the value of a the symmetry of the curve depends; a may have any value 
between 0 and r. At zero the curve is symmetrical but steeper than the normal 
curve. As a increases asymmetry develops (asymmetry being taken to mean the 
excentricity of the median), the maximum excentricity being reached at a value 
of tan- 1 ( b[a ), thereafter the curve returns gradually to symmetry at a value of 



. Fig. 2. Cum from equation (0). ConstantB: a=4, 6 = 1, k=:Sa n , a =80°. 

£* t, where it is flatter than the normal curve. Skewness develops with asymmetry 
but more slowly, and the maximum skewness is not reached until a has a value of 

K 

If k is relatively small, i.e. is near 3cr a> and a/6 is large, and if a is near \tt, we 
get a curve with a maximum value on each side of the median, symmetrical when 
a is \n. This distribution does occasionally arise in practice; an example is given 
by Udny Yule (1932). Fig. 2 is an example of a slightly asymmetrical curve of 
this type. 

(4) A general solution 

K may very well occupy a position within the bulk of the surface; this must 
happen when both of the variables have negative values If we can obtain an 
equation of the curve in this case, it should have a general application to all values 
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of k. Geary stated this problem but did not proceed to its solution, Fieller gave a 
general algebraical solution. 

We may consider K as lying on the circumference of the ellipse 


where h = kj<r a and the ordinate on the circumference of the ellipse is 


1 

2 nab 


e~W, 


In terms of the primary variables, 

1 (X* 2 rXY F 2 \ 

~ o- x cr v o%}' 

As before, the frequency curve of the angle 0 is given by the positive moment of the 
normal curve (4) about the ordinate at K. This moment may be considered in two 
parts, the moment arising from the portion of the curve without the ellipse (A), 
and the moment arising from the portion of the curve within the ellipse on which 
K lies (B). 

(A) For any normal curve the moment about the origin for that part of the 
curve beyond the ordinate at a given deviation her is 


which is equal to 


if 


xe~^ xlo) ‘dx. 


her 

y 0 cr 2 e-* h \ 


That is to say the moment is a function of the ordinate at the given deviation and 
of the standard deviation. Moreover the moment about this ordinate of the two 
equal tails of the curve is equal to the moment of these tails about the origin of the 
curve, so that in our case the required moment is 


e~ ih ' ab 

7t a 2 sin 2 (a + 6) + 6 2 cos 2 (a + 6)' 


( 8 ) 


(B) The length of a chord of the ellipse which passes through K and makes an 
angle of a + 6 with the major axis is 


2hab 


*J(a 2 sin 2 (a + 6) + b 2 cos 2 (a+ 6) ) 


J[ l ih*j (a 2 ei 


k sin# 


sin 2 (a + 0) + b 2 cos 2 (a + 6) ) 
&sin0 


(0) 


( 10 ) 


and if we make sin cl = y . ,, —g- —-—-g— 

T h ^{a- sin 2 (<x + d) + b i cos 2 (a + 0)) 

we may put (9) as 2haoa^(r ia+g) . The total area of the normal curve in this plane 
using (4) is 1 

\j{2n) j(a 2 sin 2 (a + 0) + b 2 cos 2 {a + 6]) 
so that the ordinate at its origin is 

e -t(ft sin 


e -i(A sln<«‘ 


277-06 
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and the area of the portion within the ellipse is 


civ. 


i , ,„fW‘W) r i v \ 2 

—-e-KftMnjS) eX p ---1 

nab jo L 2 l cr (a+«/ 

This is multiplied by h cos <f><r {a+0) , 

t rft cob ii 

__g~KA sin $)' ^h coa e~ lu> d«, 

which may be simplified iiito 

T Wpgw l 1 * c ° 8ff+ O (h coa# 


adding (8) and (12a), we have 




y 


ab 


77 a 2 sin 2 (a+d) + 6 2 cos 2 (a + d) 


( 11 ) 


( 12 ) 


+ y-g-g cos <Pf + iYTI 008 ^)® + • ‘ : ( 12 <*) 


1 + {h cos?!) 2 + z —q (6 cos (j> ) 4 

1.0 


+ ro 008 + rxu^ cos ^ )8 + • • j ( 13 > 


as the equation of the curve. This expression converges quite rapidly for values of 
h up to 3; thereafter convergence becomes very slow indeed. 

Reverting to (10), the value of sin$ may be put in this form 

a6ain# 

f(a 2 sin 2 a+6 2 cos 2 a) f(a 2 sin 2 (a+ d ) + 6 2 cos 2 " (a+ 0))' 

From this the following identities may be established: 

, a 2 sin a sin (a+19) -f 6 2 cos a cos (a+0) 

C08 ^ V(« 2 sin 2 a + 6 2 cos 2 ct)f[a 2 sin 2 (a + d) + 6 2 cos 2 (a + 0)) ’ 

dtj) _ ab 

dd a 2 sin 2 (a+0) + & 2 co8 2 (ot+0)’ 

<!> - tan -1 {(a/6) tan (a + 6)} - tan -1 {(a/6) tan a}. 

It should be noted that the .limit of the integral in (5 a) is h sin <j> and that the value 
of f in (16) makes the practical work of calculating a series of these limits very 
simple. If the distribution of 6 is still to be regarded as round the median at a, 
9 will have the limits as before, - (a 4- y) and n-(cc 4- y), and at these limits ip will 
be + \v. generally, however, 6 may be regarded as having a range of 7 T beginning 
at any value, in which case (j> will have the same range but with a different 
distribution. 

A geometrical construction to show the relationship between 8 and <p is given 
in Fig. 3, where OA and OB are equal to a and 6 respectively, and O'A B is a 
right-angled isosceles triangle on AB as hypotenuse. 


(10a) 

(14) 

(15) 

(16) 
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Here 


AP = 


asina 
sin OP A 


and 
so that 

by the same reasoning 

similarly 

therefore 


6 sin (for-a) 

~ sin OPB ’ 

JlP/PB =i (<*/6)tana; 

AP/PJB = tan^OT, 

AO'Q = tan -1 {(a/6) tan (a + 6)}\ 
PO'Q = 4>. 


This relationship between 6 and <j) shows that the solution consists essentially 
in referring an asymmetrical system to an equivalent system where the standard 
deviations are equal. 



If we now turn back to (6), this can be put in the form 


.. = 1 hab (a 2 sin a sin {a + 6) + b* cos a cos (a + 6)) _ nns 

y aJ(27t) (a 2 sin 2 a + 6 2 cos 2 a) i (a z sin 2 (a + <9) + & 2 cos 2 (a + 0)) 8 ’ ' 

so that we have the approximation of (6) to (13) 


e~ ih ’d(I> / fft «on <p \ 

Acos^e^ 008 ^'^—1+Aoos^et (ftoOB « i), J e,~l u> du\. (18) 

xpressi 

"Mil. 

dd\ 


e~*>‘ d<f> 

<](2n)dd 

It will be seen that the last expression may be written 
e -w d6 a-V 1 ' 

3E«■+• 


• h cos <f> ep h 003 ' 


% 0G 

)■ 


cos <ji 


er^'du . (19) 


The difference between the values of y given by the two equations (if we do not take 
into account the value of d<j>/dd) is given in the table below for different values 
of h and for <f> = \u and 0, showing that (6) must be a poor approximation to (13) 
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when h is less than 3 , but that beyond that figure the difference rapidly becomes 


negligible. 


h 

Difference between (6) and (13) J 

4>-\P 

0=0 

1 

0-193665 

0-066476 

2 

0-043084 

0-000776 

3 

0-003536 

0-000305 

4 

• 0-000107 

0-000005 


Fieller’s general solution of the problem is given in his formula (24) 
1 <r x a- v f(l-r 2 ) f 1 1 (& x y 5*\"| 

^ ^ ~ 7rcr|-2rOT 3 .cr 1 ( +t) 2 o-| exp L 21 -r 2 \<r 2 T cr a <r v crJ/J 


+ exp ,-w- 


{y-vx) 2 


2 CT* - %rv<J a <T v + V*0 r* 


i] 


tryitycx— 5o>)+txr»(rxg , t — yor x ) 

cr y (ry(r a - xcr v ) + vcr x (rx<r u - ya x ) MM (V'SwrHWit , u ,, 
n(cr 2 — 2rvcr x c r v 4- r>V 2 )* Jo 

v is the value of the ratio under consideration which may be put as tan (a + 8+8), 
and if we reduce Keller's formula with its symbols based on the cso-ordinates of 
the primary variables to the system with its symbols based on the principal axes, 
- we get the following identities: 

cr*~ 2 rv<r x cr v + v 2 <r 2 = (1 +i> 2 ) (a 3 sin 2 (a-f 0 ) + i 2 cos 3 (oc + 8)), 


1-/2 = (a 2 b 2 )l((r 2 <r 2 ), 

~ x y y 2 _ k 2 (a 2 Bin 2 a + fe 2 cos 2 a) 
r < 7 x o\ J a* trier* _ 


(y-vx) 2 = (1 + v 2 ) k 2 sin 2 8, 


a v (ry<r x - x<r y ) + v<r x (rx<r v - ycr x ) = kf(l + v 2 ) 

x (a 2 sin a sin (a + 6) + b 2 cosa cos (a + 0)), 
so that Fieller’s formula as a whole reduces to 


f(v) = I — 1 — + g~i« B in 0 « # hcosfi iVicob^ 
n 1 7rl + v 2 dT +e d07r(l + « 2 )J« 6 dU - 

Here d<f>jd9 = 1 + u 2 , so that the distribution ft(v) of v given by Fieller’s equation 
is equivalent to the distribution of 6 in the sum of ( 8 ) and ( 12 ). 

The curves arising from (13) are of very great variety of form. They are all 
limited, indeed they are more properly described as cyclical, the ordinates at the 
limits for Q, since <j> then equals + ^n, being both equal to 


e iht q 4 sin 2 a + 6 4 cos 2 a 
7T ah (a 2 sin 2 a + b 2 cos 2 a) ‘ 


(20) 
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That being so there is no necessity to regard the curve as beginning and ending at 
the limiting values for d\ in practice it would probably be taken to begin either 
where a + dis, zero or where the ratio under consideration has a value of - co, i.e, 
where a + S + 0 is — \tt. The unit deviation in both equations (6) and (13) is the 
radian; for practical work njn would probably be used which would require a 
. corresponding alteration in the equations. If we continue to regard the curve an 




Fig. 4. 1W ourvos from equations (13) to illustrate the change in form os k increases 
from 0 to 3. The curves commonoe where a-\-0~0. Constants : a = 2, b = ] , a s= 45°. 


drawn with the median at a, we find that when h is zero it is U-shaped except when 
a is small, and is more or less asymmetrical depending on the value of a. With 
values of h about 3 it is possible to produce curves of a highly asymmetrical type; 
as h increases beyond 3 the curve reverts to the bell-shape and the limits recede 
from the bulk of the curve until, as we saw, when h is infinite the curve becomes 
the normal curve and is unlimited. Fig. 4 is an example of the development of the 
curve as h increases from zero to 3. 
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(5) Integration 

The value of a frequency is the integration of y in (13) with respect to the 
variable 6 and, making use of (15), this is 
p-ih 1 ri i 

Q($) - —p J T 1 + (h cos <j>Y+■ p-g (h cos 0 ) 4 

+yj 5 ( h cos< ^ 8+ rro (h008 ^ + "') d ^' (21} 

An expression for this integral may be obtained by converting the series in 
powers of cos Q into a series of cosines of multiples of </■>, integration then gives a 
series in sines of multiples of <f>) but the functions of h which are the factors in the 
series are very complicated and do not lend themselves to easy computation so 
that it is better to reconvert into a series in powers of cos <f>, and (putting m for 
lh 2 ) this is 

Q$) = _e-«) +1(1 e~ m rn) eos 2 <£ 

+ ~ |l - e~ m - e~ m m— e" w ~j-j cos 1 ^ 

2,4.6/ „ ' M m 2 m*\ , \ , 0o . 

+ 3TT\ 1 ” e m ~ e gT ” e ' 3 !) 008 0 ■• | > ( 22 ) 

where the occurrence of the terms of Poisson’s series is very interesting. 

(6) Conclusion 

The frequency distribution of the quotient of two normal variables may, then, 
give rise to most of the forms which are met with in statistical work. It is not, 
however, suggested that such statistical distributions always arise in this precise 
fashion; at the same time, from geometrical considerations, it seems likely that 
the product of two variables would produce a similar set of curves. It is not 
impossible that a large number of primary variables might group themselves into 
two secondary variables of approximately normal distribution and that the final 
distribution is some function involving either the quotient or the product of these 
variables. However that may be, the fitting of this curve to any given distribution 
appears to present many difficulties and is quite beyond the scope of this paper. 

(7) Example 

The following example illustrates the practical use of (j) in the application of 
Geary’s approximation. Some of the difficulties of childbirth are undoubtedly due 
to a disproportion between the size of the foetal head and the size of the bony 
opening through which it has to pass, the brim of the pelvis. This difficulty 
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becomes absolute, i.e, demands caesarian section, in about 1 % of all cases (in 
Guy’s Hospital (1937) ten cases were dealt with by caesarian section on account 
of disproportion out of 990 pregnancies in a fair sample of the population); it 
would be well to know for the purposes of prognosis the percentage ratio between 
the size of the passenger and the size of the passage beyond which spontaneous 
delivery becomes impossible. The foetal head, as it passes, is roughly circular in 
section and the area of the maximum section may be calculated from the biparietal 
diameter; the figures for this diameter are taken from a series of 1010 measure¬ 
ments by Ince (1939). It has been shown that the area of the pelvic opening is given 
to a close enough approximation by the area of the ellipse on its antero-posterior 
(conjugate) and transverse diameters; the figures for these are taken from a series 
of 360 measurements made by radiology (Nicholson, 1938). It might be well to 
add that the radiological method used (Nicholson, 1936) has a probable error of 
accuracy as low as a millimetre. 

These figures are 



Biparietal 

Conjugate 

Transverse 

Mean, (mm.) 




Standard deviation (mm.) 




Coefficient of variation 





The distribution of these variables is normal, and the two latter are independent, 
so that we can estimate the following figures for the two areas: 



Uoetal head 
(?/) 

Maternal pelvis 
(*) 

Mean (sq. cm.) 

05-8 

12M) 

Standard deviation (sq. cm.) 

5-8 

12-0 

Coefficient of variation 

8-8 

. 

10-7 


The distribution of these variables is not, theoretically, normal but the error 
from assuming normality will be negligible; we shall also assume that they are 
independent, an assumption which is apparently not unreasonable. We may now 
calculate the following constants for the frequency curve for the ratio: 

a = 12’9; 6 = 5-8; a = tan' 1 ( YjX ) = tan' 1 (65-8/121-0) = 28° 32'-25; 
k = 137-703; «r s = 9-357; h = 14-720. 
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Prom the tables of the normal curve we get the deviate for a frequency of 
1 % as 2-3263, and applying (5a), (10), and (16), we have 

hm<j )» 2-3263, 
sin^ - 0-15804, 

^ = 9° 6', 

tan -1 {{^) tan a} ~ 50° 25', 
tan" 1 {(a/5) tan (#+$)} = 59° 31', 

(a/6)tan(a+0) = 1-69879, 
tan(a+0)« 0-764. 

The required percentage ratio is then 76-4 %. Using the usual approximation 
(Y/l){l4(crjJ) 2 }, the mean of the ratio would be 55-5%, and its standard 
deviation ( Y /l) {{<r x jlf +(cr y /F) 3 }, 7-5 %; if wehadassumedthatthedistribution 
of the ratio was normal, we should have got a result of 72-9 %; so that, even when k 
is quite high, the distribution of the tails of the curve may be far from normal. 

The value of the figure 76-4 from the point of view of prognosis is that we can 
now predict that unless an event has occurred, the chances against which are 
99 to 1, a pelvis with an area of llOsq. cm. can pass 99-9 % of foetal heads, that 
a pelvis of 100 sq. cm. can pass 97 % that a pelvis of 90 sq. cm. can pass 7 0 %, but 
that a pelvis of 80 sq. cm, can pass no more than 21 % of foetal heads. 
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THE STATISTICAL SIGNIFICANCE OF 
CANONICAL CORRELATIONS 


By M, S. BARTLETT 

. 


1. In an important paper published in this Journal, Hotelling (1936) has 
shown that the generalized variance matrix* N 



of a vector variate x which has been partitioned into two parts x x and x 2 with, 
say, q and p components, can, by appropriate linear transformations L 1 X 1 and 
L 2 x 2 of x x and x 2 , be thrown into the canonical form 




R is a rectangular matrix which is zero except for a leading diagonal of squares 
A‘f of canonical correlations. 

Similar operations on the estimated matrix variance V give rise to estimated 
canonical correlations l { , which measure the correlations between estimates of the 
linear functions L x x x and L 2 x 2 . While Hotelling has given asymptotic standard 
errors for the coefficients l {! it is known that the significance of these correlations, 
as in the simple case p = 1, is more generally to be interpreted as the significance 
of the regression relations of x 2 with x v the validity of any exact tests of signi¬ 
ficance depending on the supposition that the dependent variate x 2 , apart from 
its linear dependence on x x , is normal. 

Special cases of the simultaneous distribution of the correlations l t , when X 2 
and Xj are unrelated, have been considered by Hotelling (1936) and Girsohick 
(1939), but an important theoretical advance is represented by the derivation of 
the distribution (under the same conditions) for any values of p and q (Fishor, 
1939; Hsu, 1939). It will be shown that this distribution makes available further 
possible tests; and since the problem of the most appropriate tests of significance 


* A matrix is usually denoted by a capital letter, and if it baa both a population and aarnplo 
value, the population value ia given in heavier type (of. Bartlett, 1939). The transpose of any 
matrix A is denoted by A', A matrix with only one column is a vector, and ia often denoted by a 
small letter. To avoid confusion, a 1 vector variate X is written in heavier type throughout, to distin¬ 
guish it from a single variate x, Ifx is measured from its population mean, the variance matrix V 
is the average value ofxx'. In practice we lose one or more degrees of freedom by measuring x 
from sample or regression means, but without loss of generality we shall suppose that our sample 
consists of measurements ofx with v degrees of freedom. 
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has not always been considered very adequately by other writers, it is also the 
purpose of this paper to explain the logical relation of these further teats to tests 
of significance previously available,* 


2. For the case p the distribution of l { , when these roots are arranged in 
order of magnitude, is given by 

F(im . ll)dl\dl\...dl%, 

where F - 0 ft ft (*S“9)1 


and 


G 




r\{y-i + \) 


-i+ I) 


( 1 ) 


-.i + l)rtiv-q-i+l)ty{ V - 
For p > q, we need only reverse the roles of x x and x a , 

A criterion which is useful in detecting the simultaneous departure of several 
roots A^ from zero is the product 


p 

na- 

1=1 


•l|) = A, say.f 


Whenp = 1, the distribution of A is equivalent to that of If, ami the distribution 
in (1) can be transformed if required into Fisher’s z-diatribution. When p — 2, it 
was found by Wilks that a similar distribution exists for *JA. For p > 2, no exact 
test is at present available, but the formula 

X 2 = - -1(3>+3 +1)} log/l. 

with pq degrees of freedom, gives a good approximate test (Bartlett, 1938). 

If the roots Af, ...,A| are zero, we are, however, including in A irrelevant 
degrees of freedom which might possibly obscure the significance of Af. For any 
test on Af by itself, we have little choice but to consider Zf, though we do not really 
know whether Zf is the root corresponding to Af or not. The probability distribu¬ 
tion}: p{l\) is theoretically obtainable from (1), and hence also 0-05 or 0-01 levels 


* The distribution, of obtained by Fisher and Hsu has also been obtained by Roy (1039), 
though this writer was concerned with the different problem of comparing the dispersion, in two 
multivariate normal samples. For a single variate, testing the significance of a sum of squares 
separated off from the total sum of squares by a multiple correlation or regression formula is 
equivalent to testing the ratio of two variances, a criterion also employed to test the equality of 
two population variances. Roy has prop'osed generalizing the latter problem along lines which give 
rise to the same distribution problem solved by Fisher and Hsu, but while ho has independently 
obtained the same general distribution, the need for some care in the choice of tests in multivariate 
analysis is even more evident in the problem with which Roy was concerned. It is obvious, for 
example, that the p roots which Roy considered cannot represent all the possible differences among 
the bp(p+1) variance and oovariance parameters between two p -variate normal samples, and some 
explanation of their interpretation seems required. 

t This criterion has been proposed by Wilks (1932), Bartlett (1934), and Hotelling (1936), the 
laBt-named denoting it by z. 

} The probability of a random variable having a particular value x is denoted by p(x). If the 
variable has a continuous range of values, p(x) denotes the probability of the variable falling in the 
interval x,x+ dx. The corresponding notations x j y and p(x | y) are used when the variable is only 
being considered for a fixed value y of another variable. The probability symbol p is not of course to 
be confused with the number p of components in the vector variate x 2 . 
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of significance of l\ for specified values of v, p and q. The tabulation of these levels 
would be useful, but would also be a task of some magnitude, and it is therefore 
worth noting that owing to the problem of identification, the largest root l\ is not 
a sufficient statistic for Af, and p(l x ) has no unique relevance. If we consider, 
instead, the distribution of l\ forgiven values of l\...Ip, we have .corresponding 
to the probability relation 

the probability density relation 

where the function f x , apart from the constant term/ 2 , is determined at once from 
the function F. 

In the logical situation we are postulating where Af, but not the other roots, is 
different from zero, it is not evident which distribution, p(l\) or p(lf \l\, 
provides the more powerful test, owing to the absence of sufficiency properties, 
and it is of some interest to consider in detail another problem which is trivial in 
itself, but serves to illustrate the principles involved. 


3. Suppose we have a pair of variates x x and x 2 both independently following 
a rectangular distribution p{x) = dx> {0 ^ x<1) . 


One variate (unspecified) is then shifted a distance a, so that it follows the 

distribution . . , . . . ^. , 

p(x) — dx, (u^x^l+a). 


If x x and x 2 denote the variates in order of magnitude, we shall detect the shift 
a from the larger value, x x> if a is large enough. To compare the value of p( x i) and 
p(x 1 1 x 2 ), we note first of all that when a — 0, 

p(x x ) = 2x x dx x , (0 ^ x x =% 1) 

/ i , dx-, 

P(*i I *») = -y—-- (*» < *i < 1) 

For the significance level e, p(x x ) gives a critical value % x = 1 - e), while p(x x 1 x 2 ) 

gives x x = 1 - e(l - x 2 ). If a is different from zero, a peculiar feature (analogous to 
the canonical correlation problem) is that the larger observation x x may or may 
not be associated with a. For p( x x ) we find 

(2x x —a)dx x , (a ^x x ^l) 

dx i. (l<Xj^l~f*cc) 

For p(x x | x 2 ), we have 

2 dx, 

a + 2(1 — x 2 )’ 
dx x 

a + 2(1 — x 2 )‘ 


(lcc^l + a) 
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This is provided x 2 ^ a; for x 2 < cc, we have 

p(x x 1 = dx v {a ;£ x x ^ 1 + cc) 

Using the terminology of Neyman and Pearson, we shall denote the power of the 
test derived from x x hy P; and for that from % | x x , by P'. Then 

rv a-6) 

1-P=I (2x 1 -a)dx l 
= \-e-af(l~e). 

For 1 -P', we have first of all, for given x 2 , an integral 


n- 

J Xt 


which gives 


l-e(l—x 2 )-a, 

2(1 —e) (1 —X;,) 


J>(*i i *a). 
(a: 2 <a) 


a+2(l-x„) 

Since p{x 2 1 cc) is given by 

(a-t-2(l— x 2 )|dx 2 , (u.^x 2 ^l) 

dx 2 , (0 < x 2 < a) 

we finally obtain, after averaging over x 2 , 

1-P' = (l-e)(l - a) - Ja 2 e. 

Before comparing P with P', we may remember that we do not expect either x x 
or x\ (x 2 to provide the most powerful test obtainable. Theoretically we can see 
what this test would be by considering the ratio 

p{x v x x j cc)jp(x x , x 2 j 0) = X a , 

say, though since the value of X a is indeterminate unless the true value of a is 
specified, it should be realized that X a does not provide us with any actual test, 
only with a theoretical upper limit for P or P'. 

The criterion X a has the distribution 

X x — oo 1 0 

p(XJa)=:a (1-a f a(l-a) 

p(X a j 0) = 0 (1-a) 2 2a-a 2 

For (1 - a) 2 ^ e, we shall allow the value X a ~ 1 to be significant in ej{ 1 - a) 3 of the 
times that the value 1 occurs; if (1 — a) 2 < e, we allow X a — 0 to be significant in the 
traction e-jl-af 

2 a-a 2 

of the times that X a = 0 occurs. The power P" of a test that could be based on X a 
isthen a+e, [(1 —a) 2 5se], 

«■+ 1 C -«)’ ■+ - «), HI - »)H e). 
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Comparative values of P, P' and P" are given in Table 1 for e = 0-05 and 0-10. 

Table 1 


e a 

0 

0-1 

0-2 

0-4 

0-6 

0-8 

0-9 

( p 

0-0500 

0-1475 

0-2449 

0-4399 

0-6348 

0-8298 

0-9272 

0-061 P' 

0-0500 

0-1453 

0-2410 

0-4340 

0-6290 

0-8260 

0-9253 

!p" 

0-0500 

0-1500 

0-2500 

0-4500 

0-6500 

0-8464 

0-9373 

cp 

0-1000 

0-1949 

0-2897 

0-4795 

0-6692 

0-8590 

0-9538 

0-10j P' 

0-1000 

0-1905 

0-2820 

0-4680 

0-6580 

0-8520 

0-9505 

[P" 

0-1000 

0-2000 

0-3000 

0-5000 

0-7000 

0-8785 

0-9603 


It will be seen that p(x x ) provides a test in this problem rather more powerful 
than 'p{%\ | x 2 ), but that the latter is quite effective. We cannot of course transfer 
this result to our main problem, but it is clear that p(l\ 1 1) may justifiably 

be considered, at least until the distribution p(l\) has been tabulated. 

4. Returning then to this distribution, we may examine one or two special 
cases before formally noting the significance level of l\ in general. It has been 
shown by Fisher and Hsu that for v large, the distribution of l\, l\,I £ tends to 

0(rn(, ra|,m|) dm\dm\... dm 2 v , 

where m\ = \vll 0 = O' fl ((raf)* (a- * _1) e _m < a (m\ — m?)j, 

i=i \ j~i+1 I 

and llC'=fl{ri(q-i + l)rtip-i + l)}. (2) 

i= 1 

For the particular case p = 2, q — 3, the distribution of m\ j is 

{m\ — m\) dm\, 

which is a function simply of If alternatively we consider the distribution 

p(mf), we obtain 2 e - m i' J {e-'V - (1 - ml)}dm\, 

the 0-05 significance level for which is 5-37. From p{m\ | m\) this value of 5'37 
corresponds to a level 0-030 if m\ = 0, to 0-045 if m\ is equal to its expected value 
0-50, and to 0-05 when m\ reaches the value 0-63. These results merely illustrate 
how the significance level of mf depends on which distribution is being used. 
For the case p — 3, q = 4, the significance level for m| can be written 

where u = v = ml-ml The level of significance thus depends mainly on 

u, as we should expect, but the effect of v is not negligible. The factor multiplying 
the exponential varies, for example, when u = 4, from 10£ for v = 1 to 13 for 
v = 0. 
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The general expression for the significance level for m\ is 

f” n 

J m , 1 *=»__ 

f” (mf) 1(!MS “ 1) e~ m i* n (mf - mf) dm? 

J mi’ i"8 

or, if we write r x («) = j txf-~ x e~ x ix, 

J <T 

by 

r m ,(i[p+s+i])-{l4r Wi 4[p+g-i])+fs , ? 

1 _ U~2 J _ vi-g J~i+l _ l ____ 

r w m + ? +1])- (S *W(*[p + 2 - !]) + (A, I ™< m ?| + 2-33) 

U°»2 / Vi**” , 

/ A 


For the more general case of finite v, we haVe similarly for l\, 
f 1 a-j.-D (i _ [ll - ii) dl\ 

J k' _1^2_ } 

f 1 (1 - 1\)W-<i-p-V n Ql-lfl dl\ 

J V i-2 

or, if B x {a, ft) - j* oc?- 1 (l-xy~ l dx, 

J X 

by 

+ 2+1], ftv • - P ~ 2 +1]) - ( i JjU t . ft [p + q - l], iO - V ~ <1 +1]) + > * • 

U—S 


The dependence of results (3) and. (4) not only on v, p and q, hut also on the 
particular values of Z|,..., makes it impraotioable to tabulate the 0- 05 or other 
levels of significance; but it is not difficult in any instance to find tbe exact level 
from (3) or (4), using the published tables of Fjia) or B x {a, /?).* 

It must be recognized that if the second root A| is also 1 different from zero,the 
distribution of l\ for given l\ is quite irrelevant, but except possibly when p is 
rather large, it is probable that two or more non-zero roots would be detected by 
the A criterion, and the testing of Af alone by means of l\ (a test which is still not 
completely efficient) would not arise. 


5. Directly we have established the existence of at least one root Af, we may 
always proceed to eliminate this correlation A^ and the corresponding pair of 
canonical variates; and analyse the remainder. The theory of eliminating from 
x 2 a set of specified variates represented, say, by the vector variate x 0 has been 

* Tables of the Incomplete r-function, ed. K. Pearson (1922, His Majesty's Stationery Offioe, 
London); Tables of the Incomplete Beta-function, ed, K. Pearson (1934, Biometrika Offioe, Univer¬ 
sity College, London). 
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indicated, by Bartlett (1939).* As a particular case, x 0 may be a hypothetical set 
of r canonical variates of x 2 , and the criterion A{v—r,p~r,q) for the remaining 
variate x a>0 , in place of the original criterion A{v,p, q) for x 2 , would test the good¬ 
ness of fit of the hypothetical vector canonical variate x 0 . In the case q = 1, we 
have the goodness of fit of a hypothetical discriminant function, the problem of 
which was first raised by Fisher (1938). 

It has, however, also been pointed out (Bartlett, 1938, p. 39) that if the canoni¬ 
cal vector variate x 0 has been estimated from the data, the symmetrical relation 
between x 2 and x x will imply that each has only p-r and q — r independent 
components remaining, the y 2 approximation for the criterion 

■4'= n (i-S) 

i=r+l 

being -{(v-r)~ £[(p - r) + [q - r) +1]} log A! = - {v - \{p + q +1)} log A!, 

with (p—r) {q — r) degrees of freedom. It was stressed that this reduction of the 
degrees of freedom essentially depends on the existence of non-zero roots 
Af, ..., A-!, so that the vector variate x„ is well-determined, and any effect of 
selection of l \,..., P r from l \,..., Z 2 can he neglected. Under the same conditions, 
we may approximately use the tests known for p = 1 or 2, for the criterion 
A'(v-r,p — r,q-r), when p-r = 1 or 2. 

6. To demonstrate the reduction in degrees of freedom in the case r = 1, 
consider the case when v is large, and 

- vlog £ vZf-> y a . 

If i4f = Q it the determinantal equation for 9 i is of the form 

\A-ev\ = o, 

where V denotes the variance matrix of x a , and A is a matrix of the sums of squares 
and products among the p variates of x 2 for that portion of the sample separated 
off in terms of the independent vector variate x v Without loss of generality, we 
shall suppose that V = 1. 

Regarding the v observations for any variate as a vector with v orthogonal 
components, let us now add to the chance variation of the first variate of x 2 a part 
dependent on each of the q (orthogonal) variates of x v For each variate of x 1 , 
the length of the vector representing the first variate of x 2 will then receive an 
addition X k , say, (k=l...q) > which will be of order <Jv. Partitioning off the first 
variate of x 2 , we obtain, as our new equation for 0, 



i + 2Xx-±X -f* XX^ - 

-6 

dij XXjX 

a^+Z’xiX 

Uy-d 


* See equation (2.8) of the paper cited, and the immediately preceding equation. 





36 The statistical significance of canonical correlations 

The summation sign is for the q degrees of freedom of x lt and % ~ where 
aj 1( ,,,,x p are the p variates of x a . Solving the equation for the largest root, we have 

d, = ip + 2Z Xl x +%, + ' 


If we neglect the last term, the sum of the remaining roots becomes 


v P 

{■j» 1 2 


%■ 


(ZxiX)*} 


£X* 


which is a y 2 with (p—1) (g— 1) degrees of freedom. 


7. To illustrate the use of this test we may consider the data from Kelley 
quoted by Hotelling (1936), these consisting of correlations among tests in (1) read¬ 
ing speed, (2) reading power, (3) arithmetic speed and (4) arithmetic power, the 
sample being one of 140 seventh-grade school children. Hotelling, investigating 
the relation of arithmetical with reading abilities, found canonical correlations 

li* 0-3045, Z a = 0-0688. 

Since v = 139, p = 2, q = 2, the first correlation gives a contribution to y 2 of 

- {139 -1(2-+ 2+1)} log (1 - 0-3945*) ® 23-09. 

Similarly the contribution from is 0-64. The x 2 analysis is consequently sum¬ 
marized as in Table 2. 

Table 2 



D.T. 

X s 

k - 

3 

23-09 

k 

1 

0-64 

Total 

4 

23-73 


It is evident at once, as Hotelling concluded from other tests, that there is- a 
significant relation between arithmetical and reading abilities, which arises 
entirely from the first canonical correlation. 
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ON THE LIMITING DISTRIBUTION OF THE CANONICAL 

CORRELATIONS 

By P. L. HSU 

1. The purpose of this paper is to deduce the limiting distribution of Hotel¬ 
ling’s canonical correlations* under the most general assumption on the popula¬ 
tion canonical correlations, The result is stated in the theorem at the end of this 
paper. 

The method employed here is essentially the same as that by which we derived 
(Hsu, 1940) the limiting distribution of Fisher’s discriminating components. 
In what follows, steps in the derivation are given while strictly rigorous reasoning 
is left out. The latter may be found in the author’s 1940 paper. 

The parent distribution is represented by the density 

const.expj - J ( 2 <%o^ + 2 2 2 fi t9 b 1g + 2 (jKg), .(1) 

1 Vt,j=l i=l (=1 1 /) 


where 


n n n 

a ij — 2 b l0 = 2 c ph ~ 2 yplVlil" 

<=■1 lm 1 tail 


.( 2 ) 


By virtue of Hotelling’s reduction, the matrix of variances and covariances is 
taken to he 


Hi 


a ip Pn Pip " 1 




&pp Ppi •*• P 


Pii ••• Ppi Tii 

h<i ••• Pm Yq i 


pa 

Yiq 


Yqq 


1 .. 

. 0 p[ ... 

0 . 

• 0 


0 .. 

.10... 

p'v • 

.. 0 


p'l • 

.01... 

0 . 

,. 0 

,. '.(3) 

0 . 

.. p' v o .. 

1 . 

.. 0 


0 . 

..0 0.. 

0 

.. 1 


teal correlations. The sample canonical 


* ' • if XX 

correlations, r v ,.., r p) are the positive roots of the equation') 


m n .. 

* m ip 

bn 

blq 

ra p\ ■ 

.. ra pp 

bpi 

... b m 

hi • 

bpi 

rc n 

n M 

blq . 

" bpg 

rG qi 

- rc qv 


= 0 . 


•(4) 


* Hotelling (1936). For further work on the distribution of the canonical correlations, 
see Madow (1936), Girschick (1939) and Hsu (1939). 

f We use r in (4) instead of -r as in Hotelling’s original definition because it is 
known that the non-vanishing roots of (4) form pairs each of which have the same 
absolute value but opposite signs. 
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We set Pi — ... = p' H = pi, 

Pftl+l — • • • = Pp-x+pi ~ Pto 

P/ii+...+fiv- 1 +l ~ ' • • s= Pe — Pv> 

Ps +1 = • • • = Pp ~ 

p!>p 2 >... >p v > 0, 

Ti^r^ 

and proceed to find the limiting distribution of r lt ..., r p as n co. 

2. Lemma 1. We have the identity 

£ S = I S |. | P- OS _1 R |. .(9) 

This results from the identity 


P 0 


I 0 


P-QS^R 0 

R S 


-SbR I 


O S 


on taking determinants on both sides. 


( 6 ) 

( 6 ) 

(7) 

( 8 ) 


Lemma 2. Let 


% — n+^jnu iit 

*»» 

s 

4 

1! 

<r 



b u = npi+Jnv u , 

b io = ^nv io 


.(10) 

c oa = n + Jnw ga , 

c gh = 4nw gU 

(y 4= A),J 


(i,j = 1,... 

,p\ g,h - 1,... 

>«)• 



The distribution of the u‘s, v’s and w’s approach that of \{p + q){p + q + 1) normal 
variates whose means are zero and whose second moments are specified in the following 
statements : ' 

(i) any v or w which lias at least one suffix number > p is uncorrelated with all 
the others ; 

(ii) any member of one of the sets (u u ,v u , w u ), (i = l, ...,p), (u^v^v^w^), 
(i,j = l, ...,p; i =f=i) is uncorrelated with all the members of all the other sets ; 

(iii) for i,j— l,...,pwe have 


- 2, 

<*K) = i + P ?, 

*<««*«) = 2 Pi 2 . 

= <^(*%%) = 2 p\, 

${u%) = £{v\f) = /(«$,) = 1 (i + J), 
s (VijWii) .= 4%%) = p^p* (i +J), 

= p) {i +j). 


(11) 
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This follows immediately from the well-known central limit theorem * The 
second moments may easily be computed by virtue of (3). 

Corollary. Under the assumption (6) any u,v or w which has al least one suffix 
number >sis uncorrelated with all the others, and ^(v} { ) - 1 for i = s +1,..., p. 
This results from (11) on putting p\ = 0 for i = s+ l,...,p. 

3. We may now find the limiting distribution of r, +l ,..., r p , the p~s smallest 
correlations. 

We substitute (10) in (4) and then divide each element by n, There results 
the equation 


r 



jn 

jn 

IV 

a jn 

p[Jr fn • 

V 
a jn 

V+i 

jn 

V 

a/» 



n s,s+l 

jn 

- > 

% 

jn 

r! 4- 

V+i 

a/» 

V 

• 7^ 

jn 

™ 8+ u 
••• jn 

h u T) 

A Jn 

Vu 
a jn 

Vm 

a/w 

Vu+i 

jn 

Vu 

”7» 

ru pl 

jn 

IV 

jn 

«W 

a/w 



v 

a/ W 

V±i 

7^ 

V. 

7« 

p[+~r 

jn 

V 

jn 

Vu 

jn 

v 

jn 



«V+l 

7^ 

wu 

” 

hi 

jn 

•" Pa 4" ~T~ 
jn 

Vu 
a Jn 

V 

jn 

IV. 

7» 


v 

a/w 

rwv, 

~7»r 

V+l 

jn 

V+1 
a jn 

Vu+1 

jn 

V+i 

jn 

«Vw 

jn 

jn 


7» 


hi V Vu 

jn 



tw qi rw QS 

jn jn 



jn 



.( 12 ) 


The equation (12) has p—s roots which are o(l) for large n. To evaluate these, we 
substitute n ^ for / in (12), delete the common factor from the rows 


s+ l,s + 2 , ...,p,p+s+l,y+« + 2 > ... ,p+q , 


* Cf. Cramer (1937), pp. 113-14. 
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0 ... 

0 

0 

... 0 

P'l 

... 0 

0 

... 0 

0 ... 

. 0 

0 

... 0 

0 

••• p's 

0 

... 0 

0 ... 

0 

~n 

... 0 

Vi.1 

• • ■ v a-t-l,a 

v s+l,a+l 

- 

0 ... 

0 

0 

... -rt 

V pl 

• • • 

v p#+l 

... V vq 

p'l 

0 

0 

... 0 

0 

... 0 

0 

... 0 

0 ... 

Ps 

0 

... 0 

0 

... 0 


... 0 

V U+1 

Ls.a+l 

w s+l,a+l 

. •.. v vM1 

0 

... 0 

-7) 

... 0 

v lQ ... 

V SQ 

%+i,a 

• ■ • v pg 

0 

... 0 

0 

... -7/ 


0, 


i.e. 

-7j .. 

0 

Li+l.a+1 • 1 

■ • v s+l,g 


0 

>. -v 

v p,a+l 

“ V pq 


l, S+1.8+l • 1 

■ V PJ+1 

-v - 

0 

' 

v s+x,a 

V PQ 

0 . 

• -y 


= 0. 


.(13) 


By (9) the left-hand side of (13) is equal to 


(~v) a - p 


d, 


'a+lj+l 




*+1,8 




... 4 2 -9? 2 


where 


dij = 23 (i,j = s + l,...,£>). 

ff=s+l 


.(14) 

■(16) 


Let £' +1 , ..., in descending order of magnitude, be the latent roots of the 
matrix ||jj. Then the p-s roots of (12) which are o(l) for large n are 

n-KP+oin-*) {i=s+ 

If we define £ s+1 , by putting 


U = (i = s + .(16) 

then the £’s have the same hmiting distribution as the £”s. Hence the limiting 
distribution of the may be derived as the distribution of the latent roots of 
II d i} ||, in which the v’b are regarded as having a distribution which is the limiting. 
distribution described in Lemma 2. By virtue of the Corollary this is the dis¬ 
tribution of (p -s){q- s) mutually independent normal variates with zero mean 
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and unit standard deviation. Therefore* the limiting distribution of the £ { = nr\ 
has the density function 


2-to-M^-a) ~8~i +,iy n n (&-&) 


/ v \ m-p-o / 

p \ 

n Si) exp 

— \ S £i) 




(17) 


<*)>£, 


'«+! = 




The transformation ^ gives the following density for the limiting dis¬ 
tribution of the r t (i-s 4- 1 ,..., p)‘- 


fi{y»+i>--‘>Vp) = 2^-«P“»)te-*)TrK p -*>(n + 

p p \ / p \Q—P f V \ 

n n ( n vt) <* P -* s ??),.< 18 ) 


»> i | 1+x > ...>^>0 . 

4. We now proceed to find the limiting distribution of r y , ...,r,. By virtue 
of (9) we may write the left-hand side of (4) as 

rs-* j C j. | r a A - BG -1 B' j, 


1 «11 • ■ ■ a lp 

W 

11 

... b lt 

O 

II 

c u c i<? 


• 


1 ®pl ••• ®pp 




% ••• c w 


where 

A = 


Hence, if we set &i = r i (» = 1, ■■■,$), 

the B i are the roots of the equation 

j BCHB' -0A j = 0. 

Substituting (10) in (21) and dividing each element by n, we get 
|(A +n-»V) (I+n-*W)-» (A'+»-*V') - 0(1+»-»U) j * 0, 

where 

U 


u n .. 

• u ip 

v 


... % 


w = 



i V — 



• 

1 M pi • 

• u pj> 


V 

••• 

II 




A - 

Pi 

... 0 

0 

... 0 




0 

- Pp 

0 

... 0 


% .. 

W *L 

w n •• 

w m 


■ (19) 
.( 20 ) 

.( 21 ) 

• ( 22 ) 

• (23) 

..( 24 ) 


* Hsu (1939), pp. 266-7. 
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Neglecting higher powers of we write I — n~l W for (I + n~* W) _1 and carry 
out the matrix multiplication to the term with as a factor. There results the 
equation 

|AA' + w~*(VA' + AV' —AWA')-0(I + »-*U)| = 0, .(25) 

i.e. 

p[ 2 -0+n~ i (2p' 1 v n ~ p[ 2 w n - 0u n ) M~ l (p( v n +pj> v 12 - p' x p' 2 w l2 - f)u n ) ... 

n~ l (p[ v pl + p' p v lp - p[p' p w lp - 0u lp ) 

n~ i {PiV 2 i+p' 2 v 12 - p^p'zviu-duu) p' 2 z -6p' 2 2 w 22 -du 22 ) ... 

W _l (Pi V p2 + Pp V 2p - PiP'p W 2P ~ 0U tp) 


^■(P'l'I-'pl + PpHl, - P'lPp Wlp -0u lp ) n-l(p' 2 v p2 + P p v Zp - P 2 ppW 2 P - 0u ip ) ... 

p p 2 -d + n-l(2p' p v pp -Pp 2 w pp - Gu pp ) 

= 0. .(26) 

On account of (5) there are p 1 roots of (26) which are p\ +o( 1) for large n. To 
evaluate these we substitute p\ + •nr' i p x t for 6 in (26). Since the first (i x of the 
p "s are equal to p lt there will be a common factor in each of the first /i x rows. 
After deleting this factor and then letting n -* oo, we get the equation 

2v n -p x (u n +w n )-^ «i 2 +v 2 ,-Pi(% 2 +w 12 ) ... 

% 1 + Vi-Pi(«i/‘ I + w 1/ J 

«12 + «2l-Pl( M 12 + W r J 2 u 22 — p 1 (« 22 + w zl ) — £ 


^ l + %l-Pl(u ln +W 1 J V 2/ll + V pi 2-p 1 (U2 /il + W 2pi ) ... 

~ P*( U f<\tn w n ft) ~ ^ 


= 0, .(27) 

Le - z n~£ ■■■ *i„ x 

. =0, .(28) 

z /ni ■ • • z l i ll d l ~ £ 

where Z a~ v ii + Vji-Pi( u ij + W ii) . (29) 


Let £i> ••■■Ci fc ^ e roots > i n descending order of magnitude, of (28). Then the 
Pi roots of (26) which are pf + o(l) for large n are 

Pi + »"V >1 Ci + o(n-i) (i = 1,.... Pi). 

If we define £ 2 ,..., £ /(i by putting 

d i = Pi + n-ip^ (i = l,...,p x ), .(30) 

then the £’s have the same limiting distribution as the £”s. Hence the limiting 
distribution of the £ f may be derived as the distribution of the latent roots of 
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the matrix'll % ||, in which the it's, v’b and vj’b are regarded as having a distribution 
which is the limiting distribution described in Lemma 2. 

Now, if all the ids, v’s and to’s are normal variates with aero mean, so are the 
z’s. Owing to the fact (ii) of Lemma 2 all the g’a are uncorrelated. Using the 
formulae (11) to calculate the variances of the z's, we easily obtain 

<?(«?,) = 4(1-p5)*, = 2(1 -/>*)* (i+i), .(31) 

(hi - 

Hence z u =2(1- p\) t u , z {j = *J 2(1- p\) % (i =hj), .(32) 

(hi = l.—./h). 

where the <’s are mutually independent normal variates with zero mean and unit 
standard deviation. 

Setting £ = 2(1— p\)ij in (28), we.get, by (32), 


ki~V 

2-‘t ia 

2_ V 



2-**a 

t n —rj 

1 • 

= 0. 

.(33) 

2%,. 


•• W 1 ? 




Let iji,.,., r\ n be the roots of (33) in descending order of magnitude. 


The density function 

(2w)-W*+» exp { - £(i 2 u + ... + t \ m +&+...+.(34) 

is equal to (2jr) - b , i(ft+ 1 >exp | ^ S ?/) 2 j, .(35) 


which is a function of the latent roots only. Hence* the distribution of the latent 
roots has the density 

= iVT 1 !fi fi Wi-vvW p(-is^ 2 ),.(36) 

\i=“l / U—1 I \ i~»l / 

The density function (36) represents the limiting distribution of ->/[, where 


0 { = pl+2n-tp 1 (l-p2)ijl (t= 1,.,.,/q). .(37) 

Hence r 4 - <9| - /h+?r‘(l-/>!b)+o(«r*) (i= 1, ...,/q). 

If we define %,..., by 

r < = />!+«■-*(! ~Pl)Vi (i = .(38) 


then the ?/ { = »*(1 ~ Pi)" 1 (u~ p t ) have the same limiting distribution as the.?/). 
Hence the limiting distribution of ^ has the density f(rj v ...,? ?/(i ). 

In exactly the same manner we may prove that for k = 1,..., v the 

9« = «•*(!- pD^in-Pk) (*=*/*!+-+/*»-i+l,...,A4+...+/!») 

* Hsu (1939), p. 256, Theorem 2. 
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have the limiting distribution-represented by the density 

/(?*+.. 

■where, in general, 

= n^ii In n (**-*#) «p -is*? >.( 3 ») 

\i=l / j \ i*«l / 

oo>%>,,. —oo. 

Furthermore, the sets (%, {% l+ i »•••> V*/*)’ ■■■» (V H +. Va) 

are such that the equations corresponding to (27) for two different seta involve 
only mutually uncorrelated u' s, v’a and w’s owing to (ii) of Lemma 2. Therefore 
the limiting distribution must be such that these sets are independent of one 
another. Also, recalling (14) and (i) of Lemma 2, it is seen that the limiting 
distribution of j/j, must be such that the sets fa, (^ 1+1 , • •••, 

(W...+* -i+n •■•>%) and (tj B+t ,....,%) are independent of one another. 

In conclusion we sum up the results in the following theorem: 

Theorem. Let the population canonical correlations he p[, ...,p' p , where 

Px* - A* 

Ppt+l = < • • = P/ii+IH ~ Pt> 

P/i\+...^./iv~ * Ps P 

Pa +1 — ••• — Pi j — 0, 

Pl>P%> ••■>Py>0. 

Let the sample canonical correlations be r lt ...>r p> where 

r i^ r 2^ ^r p . 

Lei % = n*(l-piV(^-Pi) (i = l,...,3J). 

Then the limiting distribution ofifo, ...,y p is represented by the density function 

1. •■■’Vflyi-fti) ‘ • * /(^ /<,+... +Hv~ J+l> •••> Va)fl(V,+V'->Vp)> 
where the functions f and / L are giuen 6y (39) and (18) respectively. 
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THE APPLICATION OE MAXIMUM LIKELIHOOD TO 
DOSAGE-MORTALITY CURVES 

By F. GARWOOD, Ph.D. 


1. Itstboduction 

Many papers have been written on the fitting of dosage-mortality curves; in 
particular, a paper by Irwin & Cheeseman (1939) summarizes the methods 
whioh have been adopted hitherto. It is felt, however, that there is some mathe¬ 
matical interest in the subject which is worth emphasizing, 

A typical problem occurs when studying the effect of some drug on a particular 
kind of animal. It is assumed that there is a population of animals, and associated 
with each individual animal is a certain lethal dose of the drug, such that the 
animal would always be killed by a stronger dose and would survive a weaker one. 
There is independent biological evidence for assuming that the logarithms of the 
lethal doses are normally distributed throughout the population, so that if the 
proportion of animals expected to survive a given dose is converted into a probit 
(i.e. an equivalent normal deviate-!- 5), then the above assumption is equivalent 
to stating that the probits are linearly related to the logs of the doses. If the mean 
(or median) log lethal dose is m and the standard deviation is cr, then the linear 
relation between probit and log lethal dose is 

Y =s <x+/ta, 

where <r = 4 and m = , 

P P 

The experimental material consists of k groups, drawn at random from the 
population, of n 2 ,..., n k animals, which are given doses with logs x u x % ,,.,, x k , 
from whioh there are s v s v .,. ) a k survivors, and % - s v n a - s a , ,v. ,n k -8 k deaths. 

The treatment whioh has hitherto been applied to data of this kind consists of 
obtaining from tables the probits y Xi y v ... corresponding to the proportions of 
survivors q x = «*/% % = s a /n a , plotting the y’ s against the corresponding log 
doses aq.&j,... and fitting a line to the points, bearing in mind the following 
considerations. 

Since dqjdy -~Z, Z being the ordinate of the normal ourve, and as the 
varianoe of q is, PQfn, Q (= 1 — P) being the expected proportion of survivors, it 
follows that the variance of y is PQjnZ 2 which in general varies along the line. 
Thus different weights must be used for the various probits in fitting the line. The 
effects of using different methods of calculating the weighting coefficient 
to = nZ t jPQ (reciprocal of the variance) have been compared by Irwin & 
Cheeseman (1939). 
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A further difficulty occurred in the cases of zero and all survivors, for which 
the corresponding prohits axe infinite. Fisher’s method (Bliss, 1935),- using the 
method of maximum likelihood, overcame this difficulty by replacing the infinite 
probit by a working or fictitious probit in the regression equations. 

Mathematically, Fisher’s exact method of calculation is as follows (see also 
Bliss, 1938 p,nd Fisher & Yates, 1938). Assume rough values a x and b x in the 
relation 7 = a x + b x x. 

For each value of x this formula gives P and Q (the areas of the normal curve 
up to and beyond the probit 7), Z (the ordinate.at 7) and. w = nZ 2 jPQ. The 
regression is then found between the variate 

y = Y+y = Oi+bjX+1), .(1) 

where , - 


and x, giving weights to to the former. The result is a new regression equation 


where 


7 =s a 2 +b 2 x, 

8wx(y-y) 
2 ~ Sw{x -x) 2 ’ 


and 


a a = y-b 2 x. 


It is to be noted that this form of the regression equation is more convenient 
for our purposes than the form 7 = y + b 2 (x—x). Substituting the values of y 
from (1), it follows that 

6a==6l+ and a 2^a 1 +T t ~x{b i ~b 1 ). 

Hence the changes 8a = a 2 ~a x and 8b = b 2 — b 2 in the regression coefficients of 
y = Y+7) on x are in fact the regression coefficients of rj on x, and they can be 


regarded as the solutions of the normal equations 

8a8w + SbSivx = Swij, .(2) 

8a>Swx + SbSvjx 2 = Swxrj. .(3) 


The new regression equation 7 = a 2 +b 2 x is then made the basis of a similar 
calculation; i.e. the regression is calculated between d 2 + b 2 x + r] and x (the values 
of rj will be changed since Q and Z are in general altered), giving another equation 
7 = a 3 + b 3 x and so on. The process is continued until no change occurs in the 
coefficients. 

It is possible that some arithmetical labour might be saved by obtaining 
the corrections 8a, 8b to the regression coefficients a, b at each stage, instead of 
the new coefficients a + 8a, b + 8b, by calculating the regression between tj and x, 
but this has not been investigated. The process of obtaining the corrections is 
illustrated later (Table III). 
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In general the successive coefficients a n and b n will converge to limits (and the 
successive corrections 8a, 8b to zero), and it is not difficult to see that these limits 
are the solutions of the maximum likelihood equations, The process is, in fact, the 
same as the general method suggested by Fisher for solving maximum likelihood 
equations, as will be shown later. Also, a consideration of the foundations of this 
method shows that there is a method of obtaining the maximum likelihood 
estimates which is slightly more rapid (as regards numbers of approximations) 
than that outlined above. 

It is one of the objects of this note to point out that the problem may be 
regarded as one of estimating the parameters m and cr (or equally, a and /?), 
i.e. of fitting a normal curve to the data. The fact that this is equivalent 
to fitting a line to the theoretical relation between probits and log lofchal doses is 
only a consequence of the special nature of the normal distribution. It may happen 
in other applications that the distribution of the log lethal doses is nob normal and 
cannot be normalized by any transformation of the log lethal doses but has 
another form depending on unknown parameters; then the problem can only be 
regarded in general as the estimation of parameters from observations. 

In the case of the normal distribution the method of plotting prohits is of 
course a very convenient method of representing the data and of obtaining a good 
general picture, hut it is to be emphasized that from the theoretical viewpoint it is 
at least equally important to interpret the problem as one of estimating parameters 
as to regard: it as a problem of fitting a regression line in the ordinary sense of the 
term. 


2. General maximum likelihood estimates 


It will be convenient to recapitulate the method used by Fisher for solving 
maximum likelihood equations (see, e.g., Roshal, 1933). A sample x t ,x 2 ,...,* n 
is drawn at random from a population of which the frequency function has a 
known form depending on s unknown parameters 6 V 6 % ,.,,,0 S so that the proba¬ 
bility of obtaining the sample is 


P(*i> x 2 ,..., x n ) d^,d 2 ,...,& s ). 

If the variates % are independent of each other, as in the case of successive samples 
from the same population, then P is the product of n probability functions 
/(aq; 6 X ,6 % , ...),f(x 2 ; d h 6 2 , ,..). On the other hand, they will not be inde¬ 

pendent if they are a set of frequencies with a fixed total. 

The maximum likelihood estimates of the unknown parameters d v 0 2 ,... 
based on the information provided by the sample, are the values which satisfy 
the maximum likelihood equations 3P[dQ 1 = 0, 3 P/3<9 a = 0,.... Using the likeli¬ 
hood function r r/ . „ , , „ 

L — Hj&if # 2 1 • • • J @1) *» « ) “ P J 

the estimates must be solutions of 3Lfdd 1 = 0, etc. 
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Since they are functions of the sample, these solutions can be written 
T x = T x (x x , x 2 ,...), T 2 , etc.; if approximations t x ,t 2 ,... to T x , T 2> ... are found by 
some rough method, suppose that T x — t x + St x , etc., so that St x ,St 2 ,... are the 
errors in the approximations. Then we have 

o=nw.,...) 

+ ^ + 90,30, (*i- 

etc., ignoring terms in St\, etc. Writing • 

{t x , i 2 ,...} = L x> 

a *L 

•••) — Jj xx , etc., 

the equations become iu ^ + L xt St 2 +... = - X 1( etc. 

or = (* .( 5 ) 

*=i 

The solutions of these equations can be written 

Stj = — 2 hi L), (i — 1 , 

i=i 

where, in matrix notation, {l i} } = {Z^} -1 , the reciprocal matrix of {L tj }. Thus if 
A is the determinant formed by the elements L {j , and A i} the determinant obtained 
by omitting row i and column j, then 

A xx t -di 2 -I ^13 / 

Hz — — ~2~> Hs — Hj — ~ j * 

For two variables 

7 -Asa j _ 7 _ L xt , _ L xx . _ T T ra 

hi ~ > Hz — Hi — 2 j~ ’ *** ~ — " 2 j~ ’ a ~ ~ ^i 2 ’ 

and 8t x — (inTi + *-i2 ^2).' ^2 = — H ^22 ^2)* 

The corrections will not be exact, since terms of higher order have been omitted 
from (4); however, if the process is repeated with t x + St v t 2 + 8t 2 ,..., now used as 
the first approximations, the corrections obtained will be of the next order of 
smallness, and the process can be continued until the approximations t are as 
near as desired to the exact estimates T. 

The coefficients are functions of the observations as well as the approxima¬ 
tions t; it is in some practical cases more convenient to replace the a;’s by their 
expected values, i.e. the values which they would be expected to have if d x = t v 

Biometrika xxxu 4 
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d 2 = Let Ay be the resulting values of L {j , so that equations (5) become 

ZAySL - -L { , (t — 1 ,..., s) 

i -1 


with solutions 


8L 


■ ZhyLij, 


(i = 


where {Ay} = 

As before, the corrected, values t + St can be used as the basis of the new calculation 
and the process repeated to any desired accuracy. It is to be expected that the 
replacement of Ly by its expected value Ay will result in the approximation being 
less rapid; this is confirmed in particular examples by calculations given below, 
the difference being small, however. 

The method has been used by Koshal (1933, 1939) in fitting a Pearson Type I 

curve > y-yfc-a,)* (/»-*)*, 


to a set of frequency data. First approximations to the maximum likelihood 
estimates of the four unknown parameters 'a, ft, /q, fi % (or 0 lt /? a , 0%, 0^) were found 
by actually calculating a set of values of L and estimating its maximum position. 
The above method was then applied, In this case, if a typical group frequency is n 
and the expected proportion is p, we have, S denoting summation over the groups, 


L = constant + jSnlog p, 


c n 

S-p { , 

P 


and Ly - — p t pj+ S^py. 

As before, p { denotes dpjdd i and p i} denotes 9*p/c)0 t 3fy, The expected value of the 
last term is 

NSpy = N d Q^Sp = 0, 
where N = Sn = total frequency, so that 


Ay^-NS^i, 

P 

which were the values used by Koshal. 

The oovariance matrix* of the estimates T is approximately equal to {Ay} 
(Fisher, 1922); thus the variance of T t is approximately - A u and the covariance 
of 2\ and T 2 is approximately -- A ia , The degree of approximation is such that 
terms of a higher order in 1 jn are omitted. 

To apply the method to dosage-mortality problems, suppose the probability 
of death is a function P of the dose (or log dose) x and of unknown parameters 

* This has been used by Irwin <fc Cheeseman (1939) to derive the formulae for the 
variances and covariances of the estimates <x and b of the parameters in the lethal dose 
distribution. 
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d v d 2 ,.... The combined probability of the given set of survivors is 

n - - nl P n ~ s Q\ 

s ! (n — s )! 

so that L = const. + $[(n - s) log P + s log Q], 

and L i = S 1 ^^-P i , 

where q = sjn — observed proportion of survivors; therefore 
h, - s [;!«=«) i 5 >+ »p ) i 5 (-A 

„ nPiPj 

and A i} = - 8 . 

If the distribution of lethal doses (or log lethal doses) depends only on para¬ 
meters of scale and location, we have 

P = P(ct + flx) = P(Y)\ 

therefore P x = P'(a +J3x) = Z, and P^ = xZ, 

Where Z is the ordinate of the frequency distribution; thus 

L tt = S^(Q-q), L fi =S~~(Q~q). 

Putting ~(Q~q) = £ = wri and ~ = w, 

we have L a = $£, Lp — Sx£. 

Also L aa = SC, L a p = Sx£', Lpp = 8x%\ 

nZ(Q-q)(Z' Z Z\ a nZ * 
l,e * La * ~ 8 PQ [Z~P + QJ~ 8 PQ 

= -Sw+S/i£, .(6) 

, Z' z z 

where / ‘ == 'Z"'P + Q- 

Similarly L a p = — Svjx + S/ixg, .( 7 ) 

Lfifi = - Swx 2 + Sfix%. .(8) 

The expected value of £ is zero, so that 

Aa = - Sw, A af! = - Swx, A fif , = - Swx 2 . 

The equations for the corrections 8a, 8b to approximations a i ,b i to the 
maximum likelihood estimates, using the “expected” coefficients A, are thus 

8a Sw + 8b Swx = Surr/, 

8a8wx + SJiSwx 7, = Swxy, 

i.e. the same as equations (2) and (3). Thus Fisher’s exact maximum likelihood 
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method of correcting the regression line is exactly equivalent, in the case of any 
distribution defined by parameters of scale and position, to the above method of 
calculating successive approximations to the maximum likelihood solutions. In 
the case of the normal distribution, it should loe noted that if a and 6 are the 
maximum likelihood estimates'of a and ft, then x — (5 — a)Jb and s = 1/6 are the 
maximum likelihood estimates of the mean and standard deviation m and cr, 
since it is easy to see that these values satisfy 3 Ljdm = 0, bLjoa — 0. In other 
words, the problem can equally be regarded as the estimation of the population 
mean and standard deviation from the data. 

Furthermore, it may happen from some peculiarity of the experimental 
material that each experiment consists of one item, i.e. = n 2 = ... = 1, and 
the number of survivors is either 0 or 1. For example, the dose might represent 
some quantity which can be measured but not controlled. 

Provided always there is independent evidence on which to base the assump¬ 
tion of the normal distribution (or of some other known form), there appears to 
be no reason why such data should not be effective for the purpose of drawing 
inferences about the population. 

It is true that for the purpose of testing the hypothesis, say of normality, it 
will be necessary to group the data, and this will also be efficacious in obtaining a 
provisional probit line, i.e. first approximations to a and 6; but for the exact 
estimation of the population parameters this is unnecessary (and would, in fact, 
result in a loss of information), for there is no difficulty in carrying out the 
calculations given above, or illustrated later in Tables II and III, with the values 
of q equal to 0 or 1. On the other hand, the exact problem cannot be regarded as 
one of fitting a line to the plotted probits, since all the latter are infinite in one 
direction or the other. 

Another convenient way of regarding the problem of finding the maximum 
likelihood estimates is a geometrical one. We require to find the values a and 6 of 
a and ft which are such that L a = dL(a, ft)jda and Lp = dL(a, are zero. 
Eegarding a, ft, y as cartesian co-ordinates in three-dimensional space, we require 
to find the point P(d, 6,0) where the two surfaces y = L a , y = and the plane 
y = 0 meet. This is the same as the point of intersection of the curve L x — 0 in 
the plane y = 0 (the horizontal plane) and the curve Lp — 0 in the same plane. 
If (a lt b v 0) is an approximation to P, the tangent plane to the first surface at 
the point vertically above P y cuts the horizontal plane in a line which is near the 
first curve. 

Using co-ordinates 8a, 8b with reference to P 1 as origin, this fine has the equation 


3L(a 1 6 1 ) 

da 


+ 8a 


d*L{aft) x ) 
da 2 


+8b 


3 a P(a 1 6 1 ) 

dadft 


= 0 , 


or in the simpler notation 


L a + 8aL aa + 8bL a p = 0. 
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Similarly there is a line near the second curve with the equation 

Lp + daL a p + SbLpp — 0 

and the intersection P 2 of these lines is a closer approximation to P than P v The 
values of L aa) etc. are given in equations (6), (7) and (8) and the effect of replacing 
them by their expected values A a(X , etc. is to replace the tangent planes by planes 
through the points of contact differing slightly in direction, and to replace the 
above lines by lines whose equations are given by (2) and (3). 


3. The goodness of fit test 


In the regression line treatment of the problem, the test of the hypothesis of 
normality is provided by testing the residual variance y 2 about the regression line, 
with degrees of freedom, two less than the number of groups. From the point of 
view we are considering, it would appear more natural to calculate y 2 as in the 
comparison of observed and expected frequencies, i.e. 




(s-nP) 2 + s (n-s-nQ) 2 


nP 


nQ 


uMQ-q ) 2 nZ 2 (Q-q \ 2 „ . 

- s ~p<r ~ s pq nr) ~ SwV t 

with k —2 degrees of freedom. 

The residual variance about the regression line is 

Sw{rj — x8a — x8b) 2 = Swrj 2 — SaSwq — SbSvixr/, 
so that the two values of y 2 are identical when 8a = 0, 8b = 0, i.e. when the 
maximum likelihood estimates have been approached sufficiently closely. 


4. Comparison of methods of solving maximum likelihood equations 

S 

For convenience we refer to the method using equations £ Ly8t] = — L t as 

j=i 

method I and that using the expected values of the coefficients, viz. 


Ay 8tj — ~ Pi 

as method II. Before comparing them arithmetically, it is of interest to enquire 
whether the two are ever identical, i.e. = A ijt for a “scale-location” distribu¬ 
tion. From (6), (7) and (8) this requires that 

Z' z z A 

11 ~Z P + Q °‘ 


Now Z = dPjdY - P', so that the probability integral P must satisfy the 
differential equation 


p-\ p" 
p "T+ITp 


= 0 . 
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An integral of 
is 


P" + P' 2 /(P) = 0 

P'eimdP = C f > 


so that P' = 0P{1 — P), 

e s+ar 

and hence P = 1 + e B+or * 

Since Y = u+fix, we can choose values B = 0, G — 2 for the arbitrary con¬ 
stants, giving 


and ordinate Z == P' - \ sech 2 Y which is the distribution given by Fisher & 
Yates (1938). Thus for this distribution y = 0 and L ax — A aa , L a p = A afh 
L/w = \ hence equations (2) and (3) will give the most rapid solution. 

Tables for facilitating these .calculations have been given by Fisher & Yates 
(1938); it would appear that Tables XII-XIV of this work supply similar tables 
for fitting a distribution of the type P = sin 2 </> = sin. 2 (a+ fix ); here 
/i — — 2cot2<jJ, w = 4n, £ =in(Q — q) cosec 20, 

so that L aa = - 4 N- 88n(Q -q) cot 20 cosec 20, etc., 

and any advantage in rapidity gained by using method I is almost certainly offset 
by the simplicity in method II, since A xa = -4 N,A afi = - - YNSx' 1 . 

The point has not been tested, since no examples have come to hand in which 
a P = sin 2 0 distribution has been envisaged. 

To apply method I to the normal distribution we have ordinate 


therefore 


Z = 

y2i 7T 

E v z z 

V = 5 - Y +Q-p- 


The two methods of successive approximation have been applied to each of 
the following three sets of data taken more or less at random from those already 
published for illustrative purposes. 

Example (i). Antipneumococcus serum given to five groups of forty mice; 
WilBon Smith’s data (Irwin & Cheeseman, 1939, p. 179). 


Serum dose c.o. 

- 

X 

Deaths out of 40 

0-000625 

-2 

33 

0-00125 

-1 

22 

0-0025 

0 

8 

0-005 

1 

5 

0-01 

2 

2 
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Since the increased dose resulted in more survivors, we call q the proportion of 
deaths. 

Example (ii). Mice injected with Bad. Typhi-murium , sample A; Topley’s 
data (Irwin & Cheeseman, 1939, p, 180). 


Dose (mg.) 

X 

Survivors out of 5 


-3 

4 

0-125 

-2 

3 


-1 

2 

0-5 

0 

0 

1-0 

1 

0 

2-0 

2 

0 

4-0 

3 

0 


Example (iii). Brine shrimps, Artemia salina, in arsenical solutions having 
concentrations in geometrical progression (Fisher & Yates, 1938, p. 5). 


Solution 

X 

Survivors out of 8 

C 

-3 

8 

D 

-2 

8 

E 

-1 

6 

F 

0 

5 

G 

1 

5 

H 

2 

1 

I 

3 

0 


In each case the scale of x has been chosen with a central origin for convenience. 
The last example was used by Fisher & Yates (1938) to illustrate the use of a 
table (Table XI) drawn up for solving these problems; as, however, the arith¬ 
metical work in this note has, for purposes of comparison, been taken to more 
places of decimals than practical work demands, this table has not been used, and 
the requisite areas and ordinates of the normal curve have been taken from tables 
of the normal curve (Pearson, 1930). The results of the comparison are shown in 
Table I. 

Most of the values given are probably exact; there may, however, be an occa¬ 
sional error of one unit in the last place. The first approximation in Example (i) 
is one used by Irwin & Cheeseman; as it has already been calculated from the 
data, the errors in the coefficients are smaller than in Examples (ii) and (iii). 
In Example (ii) the first approximation was found by fitting a line by eye to the 
observed probits, and in Example (iii) the first approximation was found by a 
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Table I. Comparison of methods of successive, approximation 
to solutions of maximum likelihood equations 



Approximation 

Successive corrections to a 

and b 

Example 



Method II 

Method I 


First 

Final 






8a 

8b 

8a 

8b 

(i) 

7 = 5-6461 +0-6689* 

7= 5-6644 + 0-6761® 




0-0071 






0-0001 

in 

F = 7 + * 

7 = 6-6168 + 0-8635* 

HH 


■ I 





gglffjl 


Biff 





l 




IHH 







■ 



O'0002 


— 

— 

(hi) 

7 =4-6 + 0'6a; 

7= +6658 + 0-7128® 

-0-0269 

0-0891 

-0-0268 

0-0984 




— 0-0080 

0-0215 

-0-0072 

0-0142 




— 0'0003 

0-0021 

0-0002 

0-0002 





0-0001 




Method II. Expected values of d 1 L/Set 2 , etc. used. 
Method I. Actual values of c l Lj'da l , etc. used. 


preliminary regression calculation with the coefficients rounded off to one place 
of decimals. The arithmetical details of the calculations (Example (ii)) of the 2nd 
approximation by the two methods are given for illustration in Tables II and III. 

It is seen from a comparison of the results that there is a slight advantage, 
from the point of view of rapidity, in method I. On the other hand, method II 
entails a little less arithmetical work, and if Fisher & Yates’ tables are used it is 
probable that this advantage would be greater, although the point has not been 
investigated, 














Table II. Typical calculation of corrections to estimates of 
parameters, Example {ii), method I 
1st approximation, 7 = 7 + * 


Dose 

K-) 

X 

n 


i 

r-5 



Z* 

— 

Q~q 

Z 

PQ 


zTtQ) ' ,' 

,, y & 

£ =4- ?(ij 

PQ, p 


H 

5 

4 

0-8 

-L 



mm 


1-8126 

0-07493 

-0-2374 . 

-0-45638 

0-125 

SI 

6 

3 

0-6 

0 



039894 

H555531 

1-5956 

-0-35858 


Ei22lil2Si 


SI 

5 

2 

0-4 

1 

0-84134 


EmmI 

-0.24134 

1-8126 


0-2374 



H 

5 

0 

0-0 

2 

0-97725 


0-05399 

0 02275 

2-4285 





i 

5 

Q 

0-0 

3 





3-2874 


0-2785 



2 

5 

11 

OO 

4 

0-99997 





SifiTotH 

0-2200 



3 

5 

0 

0-0 

5 






IhI 




L X = S£=- 0-46228, Lp = Sx£ = 0-63662 
4a = SC' = -1-7629, = B*C = 3-1704, L ff = Sx'g = - 7-2120 


8a = 


44/r~4jw - - 44«-4}-Z'< 


4a 4^ ~4/ 


0.6133, 8b = 4 a /~ = -0-1952 

4a4/i-4/ 


o= 7 


6 = 1 


a+^o = 6-3867 


6 + <W>= 0-8048 


2nd approximation, F = 6-3867+ 0-8048a:. Final approximation, F = 6-6168 + 0-8636*. 


Table III. Typical calculation of corrections to estimates of 
parameters, Example (ii), method II 


1st approximation, r = 7 + x 





D 

n 




2* 

. 

„ Jb? 

■' z 

wt 

u»t 

■n 


5 

4 


-1 

0-15886 

Kfj|| 



0-1708 

Klll| 

|PI| 



5 

‘6 

0-0 

0 

0-50000 

KsjWnfl 

0-39894 

- 0-10000 

- 0-2507 

BT^B 


0-25 

-1 

5 

2 

0-4 

1 

0-84134 

Bo^kvB 

0-24197 

- 0-24134 

- 0-9974 

■ 0-4386 


0-5 

0 

5 

0 

00 

2 

0-97725 

0-02275 

0-05399 

0-02275 

0-4214 

0-1311 

0-0000 

1-0 

1 

5 

0 

0-0 

3 

0-99865 

0-00135 

0-00443 

0-00135 

0-3046 

0-0146 

0-0146 

2-0 

2 

5 

0 

0-0 

4 

0-99997 

0-00003 

0-00013 

0-00003 

0-2369 

0-0006 

0-0012 

4-0 

, 3 

5 

0 

0-0 

5 

1-00000 

0-00000 

0-00000 

0-00000 

0-1928 

0-0000 

0-0000 


SWiy = 0-6366 
-xSwij = -0-8387 
Swi]{!C-x) — -0-3021 


Swtj(x-x) 

Swlz-x)* 


-0-2034 


b = 1 

i + <» = 0-7966 


iSta 8 = 6-9494 = 1-6601 <Sta = -3-0118 

— ajiSwa = 6-4640 5 = -1-8142 

iSiuf*-*) 8 = 1-4864 Swtj - —0-4623 

rj = —0-2785 
-aflb = -0-3690 
8a = -0-6476 
o=7 

o+(5o= 6-3626 


2nd approximation, F = 6-3626 + 0-7966*. Final approximation, F = 6-6168 + 0-8636*. 

* Five significant figures were used for P, Q and Z, where possible, but only five places 
of decimals are shown in the table; similarly in Table III. 

t As n = 6 in each sample, it has been omitted for convenience from £, £' and from 
w, wx in Table III. 
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5. Summary 

The usual practical maximum likelihood, treatment of dosage-mortality 
problems (consisting of the transformation, of percentage surviving into probits, 
adjustment and weighting of the latter, and calculation of successive regression 
lines) is shown to be equivalent to calculating successive corrections to the re¬ 
gression coefficients. The process is exactly equivalent to the method, given else¬ 
where by Fisher, of obtaining the maximum likelihood estimates of the para¬ 
meters defining the distribution. A refinement of this method, using the actual 
values of the second derivatives of the likelihood function, instead of the expected 
values, converges a little more rapidly when applied to the normal distribution, 
but this advantage is offset by some extra arithmetical work. The two methods 
are exactly equivalent only for the distribution specified by P = }sech 2 z. 

The writer is greatly indebted to Mr E. D, van Rest and to Prof. R. A, Fisher 
for much useful help and advice. 
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A NOTE ON FURTHER PROPERTIES OF 
STATISTICAL TESTS 

By E. S. PEARSON 

Dr P. L. Hstj has suggested that I should write a short introductory note on 
the origin of the idea involved in his paper and in that of Dr Simaika’s which 
follows.* In searching some twelve years ago for a systematic method of choosing 
the best test of a statistical hypothesis H a , Prof. Neyman and I came to the 
conclusion that an essential preliminary to any mathematical formulation of 
the problem was the definition of a set of admissible alternative hypotheses, 
C(H). Starting from this viewpoint, our first method of selecting a test involved 
the use of the likelihood ratio, but, however useful as a practical method of 
attack, the principle underlying this approach was somewhat arbitrary. A more 
fundamental procedure, later developed, was to choose a test paying regard 
to its power function, that is to say, to the chance that its use would lead to the 
rejection of H 0 if an alternative H^H 0 of C(H) were true. It then appeared 
that a number of statistical tests in common use had the remarkable property 
that they maximized this chance for every alternative to H {) in G(H). Such 
tests were termed uniformly most powerful tests of H 0 with regard to C(H). 

That there were limitations to the situations in which a uniformly most 
powerful test could exist soon, however, became clear. These limitations were 
gradually explored, and the following papers are further contributions to the 
subject. It was found that these tests generally, though not always, con- 
berned the value of a single parameter. Such are tests of the hypothesis that 
a mean or a standard deviation has a specified value, or that the difference 
between two means or two standard deviations is zero. Further, in these cases 
the class of alternatives must be restricted; thus the two-sample i-test of the 
hypothesis that two population means and £ z are equal, is only uniformly 
most powerful for the situation in which the alternatives considered are defined 
by — £ 2 > 0 or by — £ 2 < 0 but not for both at the same time. 

In this connexion, in 1935, Kolodziejczyk was able to prove that for tests 
of a linear hypothesis, no uniformly most powerful test could exist if the 
number of parameters involved was greater than unity. This result was im¬ 
portant, since the majority of tests used in the analysis of variance can be 
reduced to tests of a linear hypothesis. 

This limitation of tests regarding the value of two or more parameters can 
be illustrated by a geometric presentation. Since, the most important features 
of the problem can be illustrated when H 0 is a simple hypothesis concerning 
the value of two parameters, I shall take this case, using notation already 
adopted in this connexion. 

* See pp. 62-69 and pp. 70-80 below. 
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Further properties of statistical tests 

Suppose that the elementary probability law of random variables %,..., x n , 
whose particular values are given by observation, is of form 

p{x v ...,x n \0 x ,d 2 ) = p(E\0 v 0 i ) t ( 1 ) 

6 lt 0 2 being the two population parameters. For a critical region w of size a 
associated with a given test, we may write 

P{Eew j e v e 2 } =J*... J p{E [ $ v 0 2 ) dx 1 .„ dx n = fi(d v d 2 \w), (2) 

If the hypothesis H 0 which w has been selected to test assumes that 

0! = 0?, = (3) 

then | w) = a, (4) 

where a is the significance level chosen. 

A power surface may be obtained by taking rectangular axes for 6 X and 
in a horizontal plane and plotting fi{8 v 0 2 1 w) as a vertical ordinate. If w 0 were 
a critical region associated with a uniformly most powerful test of II 0 , then -its 
power surface would fall nowhere below the surfaces derived from other critical 
regions satisfying (4). No unique surface with this property will, however, in 
general exist. If, for instance, we choose w 0 so that the surface will rise quickly 
in the direction parallel to the axis of 6 V we shall reduce the rate of increase in 
the direction of 0 2 , and vice versa. Power surfaces of alternative critical regions 
may, in fact, cross one another in a complicated way, but no single surface can 
everywhere lie above all others. If we confine attention to tests for which the 
power surface has a minimum ordinate of a at the point 6 §, i.e. to unbiased 

tests of H 0 , we shall still be unable to find a uniformly most powerful test in 
this restricted field. 

The difficulty in choice between alternative tests can, indeed, only be solved 
by a further formulation of the requirements of a satisfactory test. Several 
lines of attack are open: 

(i) To lay down conditions for the form of the power surface in the neigh¬ 
bourhood of the point $?, 6\. Here we may describe the objective as to make as 
large as possible the chance of detecting small departures in d x and 0 a from the 
values specified by H 0 . A method of approaching the problem from this point of 
view leads to the development of the unbiased test of Type 0 (Neyman & 
Pearson, 1938). 

(ii) To regard it as of more importance to control the form of the power 
surface at some distance from its minimum point; for example, to try to select 
a critical region for which the power surface reaches the level 

0{OvO*\w) = 0-95, (5) 

along a contour lying inside the corresponding contour associated with any 
other test. This method of approach has been examined by Dr B. L. Welch, 



E. S. Pearson 61 

but his results are not yet published. It is possible that methods (i) and (ii) will 
lead to the same result. 

(iii) To consider whether from the practical point of view, if H 0 is not true, 
the importance of the departure of the unknown parameters from 0\. Q\ can be 
measured by a single parameter, 

A=/(0iA). (6) 

If this is so, we are in fact defining a system of contours on the 6 X> 0 2 plane 
along any ofte of which we should like the ordinates of the power surface to be 
constant. Such a system would be defined, for instance, by 

A 2 = (tfi-^) 2 +(0 2 -tfg) 2 , (?) 

and if /3(6 V d 2 1 w) is to be constant for values of 9 X , d 2 satisfying (7), the contours 
of the power surface will be circles of radius A. Alternative tests would then be 
confined to those whose power surfaces had circular contours, would be the 
hypothesis that A = 0 and the uniformly most powerful test, if it exists, would 
be that for which y?(A j w 0 ) > ^(A | w) ' (8) 

for A > 0 and all alternative critical regions w satisfying the conditions stipulated. 

The problem thus presented in the case of a simple hypothesis concerning 
two parameters will arise in similar form when H 0 is composite and concerns the 
value of many parameters 6 V 0 2 , ..., 6 C . In a number of multivariate problems 
we have reached a position in which: 

(а) tests of statistical hypotheses concerning the values of several population 
parameters have been derived, as well as their power functions; 

(б) these power functions have been shown to depend on the value of a 

single function A = f(6 v 0 2 ,...,fig 

of the parameters considered. 

In the following contributions Dr Hsu and Dr Simaika have examined three 
of these tests, that concerned with the general linear hypothesis, with Hotel¬ 
ling’s generalized T 2 and with the multiple correlation coefficient. They have 
shown that of tests whose power function depends only on a certain function A 
of the population parameters, the existing tests are the uniformly most powerful. 
It is of course true that in the problems in question no alternative tests are. at 
present available or indeed likely to become so. Nevertheless, I believe that the 
discovery, resulting from Dr Hsu’s initiative, of the relationship between the 
test function and a corresponding comprehensive collective character in the 
population, has taken us a step farther in our understanding of the properties 
of statistical tests. Further, this relationship between A’ 2 and A, J 12 and 
D 2 and A 2 , R 2 and p 2 seems'to lead us round by another route to the problem of 
Statistical estimation. 
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ANALYSIS OF VARIANCE FROM THE POWER FUNCTION 

STANDPOINT 

By P. L. HSU 


A li’R'ESH study on the classical analysis of variance tests in the light of the 
Neyman-Pearson theory was started by Kolodziejczyk (1935), who formulated 
the class of linear hypotheses for which these tests may be employed. As a linear 
hypothesis is defined relative -to the set of admissible hypotheses, the study of 
the 2J 2 -test (by which we denote any test falling under the usual methods of 
analysis of variance) may be made with reference to its power function. P. C. Tang 
(1938) showed how the power function was related to R. A. Fisher’s ( G) distribu¬ 
tion (Fisher, 1928) and so was able to appraise the chance of detecting the falsehood 
of a linear hypothesis using the A 2 -test. The great theoretical value of the power 
function lies, however, in its use in comparing the relative merits of alternative 
tests of the same hypothesis. In this paper we shall prove a theorem (p. 63) 
which asserts that out of a certain class of tests the E 2 -test is uniformly most 
powerful. 

In his paper Tang has used an orthogonal transformation in the sample space 
which enabled the general linear hypothesis to be reduced to the following simple 
form: Given the elementary probability law 

Ptyu ••• ,y m > h>-~X) = WW 0 -}-< m+,l) exp j-^r 2 Qg iVi- + ^*?)), (1) 

where all real values of i) v and all positive values of cr are admissible, the 
hypothesis is that of the rfs have the true value 0: 

Vi = = 0 - ( 2 ) 

We call the above hypothesis H 0 . 

We shall set , /1h 

i=i / U-i - 


H'' 


(3) 


and call w 0 
inequality 


(of size e) the critical region for the rejection of H 0 defined by the 

(4) 


where E J is a constant so determined that the probability that (4) is true, given 
that (2) is true, equals e. 

The power function of w 0 as given by Tang can be written 



-*S{fc!-B(K+M«)}- 1 A A f 1 

ft-0 JE, 


(,®2)i«i+ft-i (.1 - E 2 f n ~ 1 d(E 2 ), ■ 


(5) 
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wher e A = ~ (6) 

An outstanding feature of the power function '(5) is that it depends on the single 
parameter A. Our problem is, does there exist another critical region of size e 
whose power function depends on A alone and which is more powerful than w n for 
certain values of A ? The answer is contained in the following theorem and is in 
the negative. 

Theorem. Suppose that the critical region w satisfies the following conditions: 

(а) w is of size e, 

(б) the power function of w depends on the single parameter A. 

Let /?(A) be the power function of w and /? 0 (A) be the power function (5) of w 0 . Then 

AA)</?o(A) (7) 

for all positive values of A. 

Proof. In the place of z v ...,z n we substitute spherical co-ordinates, viz. the 
radius vector r = (Az 2 )* and n -1 angles, 6 X ,..., 6 n _ v We deduce from (1) that 

p(y i. 0i. - A— 1 > r ) = p(y» y n f) p{y m+i>->y m )p( 0 i >- A-i )p( r )> 

( 8 ) 

where p(y v .... y nj ) = {f(2n) cr}-«iexp J - ~ ^) 2 }. (9) 

p(yn l+ v->y m ) = £ iVi-Vi) 2 ], ( 10 ) 

1 i=»n t +l I 

p(r) — 2 -i<n ~ 2 ) r n_1 exp | , (11) 

and jp(0i,..., d n _f) is the well-known product of cosines which involves none of 
the parameters r\^...,rj m and cr. 

We now make the following successive transformations: 

n t 

r = sf s = t-Zyl yi^t% (i=l,...,%), (12) 

1= 1 

and also write % = cr -1 ?^ (i = 1,...,%). (13) 

It follows that 

jPAi+I. ■••’Vm) 01> M l> 0 

= P(y ni +i>~->y m )p(6v;9 n -i)p( u v ■:■.«*!» *)» (I 4 ) 

where 

3)(« x , i) = (^2tr)-("-+%) TT-* n {r\n)- 1 e~ K exp|j 

/ \ l ( n - 2 ) / If n , \ 

x(l-M) exp^gy^y (15) 
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From now on we shall write y for the set of variables «/ n +1) .. y m and dy for 
dy nl+1> • and use similar abbreviations 6, u, dd and du, 

"Suppose now that the critical region w satisfies the conditions (a) and ( b). 
Let F(y, 0, u, t) be the characteristic function of w, i.e. r(y,6,u,t) = 1 or 0 ac¬ 
cording as the sample point falls within w or not. Let W be the sample space. 
Then the power funotion of w is 


/?(A) = I r(y, 0,u,t)p(y, 6, u,t) dydOdudt, 

Jw 


(16) 


whence, 


1-2) 

V 4 ( 71 — 2 ) 


cr)- (,i+,l i) J I\y, 6, u, t) p(y) p{6) t i(n+n i~ 

x exp^ — ^1 —S dydOdudt — rr in >-F(^n)e, (17) 

(^2<r) _<B+ "-i ) J F(y, 6, u, t) p(y) p{6) <«»+%- 2 ) 

/ t \ 1 n > \K»-2) ■ / Jt \ 

x exp I - 2 ^j - 2 exp I2 7<% I dydOdudt 

= r{\n) e*/?(A) = F{ A) = F ^2y?j, say. (18) 

Let W± be the sample space of 0, u and t, and put 

(J2 tr)-< n + n i) J F(y, 6, u, t) p(0) 2 > 

x exp I - — 2 J ^ 1 - 2><j exp ^ ,Zyi u iJ dOdudt 

— F(h) = <f>(y,y, or). (19) 


Then, by (18), f ... f y)p(y) dy = 0, 

J — 00 J — CO 

J’GO/'CO / 1 m \ 

i-e. J_ oo -J m 4 > (yn 1 +v--->y m ,yi,---,y ni > (T ) QX v[-2^.Jj + yiJ 


( 20 ) 


xexp( 2 oc iy A dy ni+l ...dy m = 0, (21) 

\i=n x +l / 


on writing for (2 <t s )' -1 r/ i {i=n l + 1,..., m).. 

Equation (20) must hold true for all real values of the a’s. Hence it follows 
from the well-known theorem on Laplace transformation* that 


-/ i m \ 

0(y,y,a-)exp| —= 0, 


* Of. Doetsch (1937), p. 36, Theorem 1. Though the theorem referred to is stated for the case 
where the number of y 's is one, it may easily be extended to the ease of more than one y by 
induction. 
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i.e, <f>(y,y,(r).= 0, whence, by (19), 

(V2cr)~(”+%> f r(y, 6, u, t) p{6) (Kn+^j-2) 

J W\ 

x exp ( "20-2) (* “ >exp A r<M< ) = ^ (.^ 7 ?) • ( 22 ) 

In particular, from <^(y, 0, <r) = 0 and (17) we have 
C^/2 cr)-< Il +%) f r(j/, 0, m, 4) p(0) tf*< m+n i-2) 

Jiv, 

( t \ / »i \W»-2) - 

""2a 5 ) ( 1 “ S w ?) dddudt = n^F^n) e. (23) 

Letting li^ be the sample space of 0 and «, we get respectively from (23) and 
(22) that 

roo 

(*J2 a)-( n + n i> \ t m+n ±~ i) 

xexp^-^jdtj^r(y,d,u,t)p(d)^l-puij dOdu = l'{\n) e, (24) 

(V2<r)“< n + n i)j. o <K n + n i- 2 )exp^—.T^, 0, %,«) p(0) (^ - 2>?j 

x exp (£!, 7 iM<) d0dtt = F y?) • (25) 

Hence, on developing the left-hand side of (25) into power series in the y’s, we 
must have 


j; 




( 2<r*) 


<2< 


x J P(y, 0,it, £) p{6) ^1— £ u? j £ y< % j dddu = 0 for odd A,. (26) 


. a— 1 '^ 2 /i.) j ^(71+711—2)+^ 

Jo 


/ f \ /* / n, \Hn- 2 ) / n t \2 ft 

x exp - -^ 2 ) rfi! J ^ jT(y, 0, m, <) p(0) j i - £ w? j ^ 17i%j d0dfo 

= %(.| i r?) A (^=1,2,3,...). (27) 

Further, equations (24) and (27) may be written as 




^i(n+7i 1 ~2) exp 


( 2 < 7 *) 


- 0 , (28) 


Biometrika xxxn 
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j Q (K«+ni“2) exp ( - —j j w r(y, $, u, t) P(d) (1 - M) 

... .* 1 
*(,?/<“<) ^- 2trft(:+^I} ] d, = l) (29) 

Equations (28), (26) and (29) must hold true for all positive values of o'. 
Hence, by the theorem of Laplace transformation,* the functions within the 
square brackets in (28) and (29) and the inner integral in .(26) must vanish 
identically: 

j w , ^ »■«•« fW ( 1 *■ 


(30) 


C / «i \Kn-2) ( n t \h 

I\y , 0,i4, t) p(0) (1 - E ttf I ( E j <20dit = 0 for odd A, 

Jw, \ 1=1 / \i=l / 

C / n, \i(m-2)/ th \2?i 

jT(l/, 0,14, f) p(0) 1 - 2 (SriMi d9d ^ 

JW, \ i= 1 / \l=l / 


(31) 




(M 


(h= 1,2,3,...). (32) 


” 2 /l L{^(?i + M. i ) + /i) 

From (31) and (32) we infer that 

J ^ r(y, 0,14, t) p(0) (l - M) exp (Eh®,) d0dw = G 7?) • (33) 

Now for any given values of y and t the integral F[y, 0, u,t)f(8, u) d6du 

Jw, 

equals /(0, u) d&du, where w 2 is the set of points in the sample space of 0 and u 
for which r(y, 0, u, t) = 1. Hence (30) and (33) are equivalent to 

r l »i \K»-S) Tj-ini rdn) 

dedu = TWT^f' < 34 > 

j w m (l - E_14?J exp [ ZnUij dddu = G r?j. (36) 

The conditions (34) and (36) are necessary and sufficient that the critical region 
w should have the properties (a) and ( b ). 

On the other hand, from (12) we have 


E* = s 

i=i 


hence u> 0 is the region defined by the inequality 


2 u\ > E\. 

i-i 

* Cf. footnote, p. 64. 


(36) 

(37) 
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Since w 0 is of size e, we must get the same right-hand side of (34) when in the left- 
hand side we substitute w 0 for w 2 . Hence 

r l rn \ i(n—2) r / nx \i( n-2) 

p{d) 11 — L «!) dddu=\ j>(0)(l-2«5) dddu. (38) 
J w, \ i=1 / Jw 0 \ ^=l / 

r l n, \ i(n—2) /Tlx \ ( n, \ 

Let J p(6) (1 - exp ( 2 dddu - G 0 1 £ 7? j • ( 39 ) 

With the help of (37) and (38) we may now appeal to the lemma proved in the 
Appendix and conclude that 

G(AKflWA), ( 4 °) 

whence, replacing y i by <r- 1 lji.y i in the integrals in (35) and (39), 
j w 'P( 9 )( l ~ 2>j)^ ) exp _£ 7i u ij d6du 

r / rh \i(n—2) ft* nj \ 

p(d)\l- exp £ y*Wf j dddu. . (41) 

If we multiply both sides of (41) by 

(j r ^iMta * (-i) • 

and integrate over the sample space of y and t, we get, in accordance with (18), 

/?(A)</? 0 (A). 

Hence the theorem is proved. 

APPENDIX 

Lemma. Let g(x) > 0 be defined for x^O and vanish for x>\, stick that 
g{v\ + ... + 1 >|) is hummable. Let f(w-y, ...,w m ) > 0 be summable. In the product 
space of the v’s and w’s let Rbea region such that 

| /(«h, • • •. w m) d( v l + • • • + v i ) ex P (7i v i + • • • + y n v J dvdw = G{y\ + ... 4- yl). (1) 
JR 

Let w 0 be the region defined by the inequality 

v\+ ...+v\^k. (2) 

Let I /K.w m )^(w?+...+<)exp(y^H-. ..+y n v n )dvdw = G 0 {y{ + ... + y£). 

(3)* 

Finally, let 

f(w v ... > w m )g{v\ + ... + vl)dvdw = /(%, ...,v> m )g{v\+... + v 2 n )dvdw. (4) 

JR JR, 


Then G(x) ^ G 0 (x) 

for all positive x. 

* Notice that (3) is not a separate condition on R 0 , but is implied by (2). 


(5) 


5-2 
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Proof. In (1) we set y 1 = x,y i - ... = y n - 0, and get 

Q(x 2 )= f{w lt g(v\ + ...+«*)exp (xv t )dvdw. 

JR 

This and the conditions on / and g imply that 0 is continuous. 

Multiplying both sides of (1) by exp (- Ey\) and integrating over the region 
we get 


K G{x) dx 

J ft 

= f{iv 1 ,...,w m )g(Zvl)dvdw ( 

J It 


exp (-Ay? +Iy i v i )dy, (6) 
J 

where K is some numerical constant. Applying a rotation in the space of the y’s 
to the inner integral in the right-hand side of (6), we obtain 


K \ b x^-Ve,-*G{x)dx 
J a 


= f f{w l ,...,w m )q(Ev\)dvdw | 
Jr 


J 


exp {- Ex\ + (Sv\f x t } dx 


— S C lJh(R)> 
h** o 


where I h {R) = j^/K, •••> w m)^ v D h dvdw, 


C ‘-J 




x\ h exp (- Ex\) dx. 


Similarly, we have 


K i b x^-*e-*G 0 (x)dx = £ c h I h {R 0 ). 

Ja A=0 


An appeal to. a general lemma of Neyman and Pearson,* on remembering 
(4), leads to the inequality 

I h (RHIM. 


Hence 


j: 


ajKn-a) e~ x {G(x) — G 0 (x)} dx ^ 0. 


Since a and b are arbitrary and since the integrand is a continuous function, the 
latter must be < 0. Hence G(x) 4 G 0 (x). 


* Neyman and Pearson (1936), p. 11. 
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ON AN OPTIMUM PROPERTY OF TWO IMPORTANT 
STATISTICAL TESTS 

By J. B. SIMAIKA, Ph.D. 


P. L. Hsu (1940) has shown that for any linear hypothesis the f£ 2 -test is the 
uniformly most powerful of all the tests whose power function depends on a 
certain function, A, of the population parameters. Two other tests of importance, 
namely, those associated with the multipip correlation coefficient and Hotelling’s 
T 2 (Hote llin g, 1931), have the similar property of being uniformly more powerful 
than all other tests whose power functions depend on the respective functions 
of population parameters involved in the distributions of R 2 and T 2 . It is the 
purpose of this paper to establish such an optimum property of these two tests. 
We shall consider them separately. 


I. Hotelling’s T 2 

The general problem that calls for the T 2 -test may be stated in the following 
way: given the elementary probability law 

p{yi,-,y«>hi> a w-> 8 QQ) 


( Q Q 

=-KKl^exp —| S , S 

( a ij — a ji> % = 


(1) 


it is required to test the hypothesis that 

Vi= 0 (2) 

Hotelling’s test consists in calculating 

T 2 = S sV Vi y p (3) 

t,l=i 

where s ij denotes the general element in the matrix ||s y || -1 , and rejecting the 
hypothesis if T 2 >T 2 e , (4) 

where T 2 is a constant so determined that the risk of rejecting the hypothesis 
when it is true equals e. 

The distribution of T 2 derived from (lj, which conforms with Fisher’s (C) 
distribution (Fisher, 1928), was obtained independently by Hsu (1938) and Bose 
& Roy (1938), and may be written 

p(T 2 \y,a) = p(T 2 \f 2 ) 

^yi2pa+7t-i^ (g) 


= e -*» j {r) " 


^=0 h\ B(^q + h,fym) 

a 

'! f2 = i £ 

i,i=i 


where 


( 6 ) 
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Hence the power function of the T 2 -test is 

p(T 2 \r)d(T 2 ), 


r, 


(7) 


which depends only on the function r of the if s and a’s. Our first theorem asserts 
that the T 2 -test is uniformly more powerful than any other test whose power 
function is a function of r alone. 

Theorem I. Let w 0 (of size e) be the critical region defined by the inequality (4), 
and w be any other critical region whose size is e and whose power function is a 
function offr 2 . Let fi(fi 2 ) and /? 0 (fr 2 ) be the power functions ofw and w 0 respectively. 

Pirn Mry w 

Proof. Let us first find a necessary and sufficient condition that w should 
have the properties described in Theorem I. We have, from (1), 


p(y, s) = Ke~t* | sy \^ exp - \ 2 %(% + y^f) 4- 2 OqW, 

w i,j= 1 


U. 


ij 


hj+ViVi (*,j-1, 


£i = 2 KijVj (i=l,...,q), 

j=i 


l-i 


Hence, on setting 
and 

we have p(y,u) = Ker** | u ii -y i y i p-^exp (2 a«%+ 2 

\ i,j- X 1=1 / 

and r = l 2 oSKfQ, 

1 , 1=1 

where a# denotes the general element of the matrix || a it 

If w is of size e and has a power function depending only on then 

K \ l%-«/i^| im ‘ Tl exp(2 ««%)dydu = e 
Jw \ 1,1=1 / 

and 

k[ 2 2 dydu = e'P/](fi 2 ) = F(f 2 ), say. 

(15) 

It follows from (15) that, on expanding the left-hand side into a power series in 
the £’s, we must have 

f Ki- 2 /i 2 /i l im-1 exp(-£ 2 <%%) f 2 dydu = 0 for odd h, (16) 

Jw \ 1,1=1 / \i= 1 / 


(9) 

( 10 ) 

( 11 ) 

( 12 ) 

(13) 

(14) 


m 


ij« 


fhj-yiV) 


lim-l 


/ a \ / a \2ft 

exp I - ^ 2 1 a-v-uy) I 2 j dy du 


= lMrf (A = 1,2,3,...), (17) 


where the a /t are numbers depending only on the region w chosen. 
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On the other hand, since the integral of (12) over the sample space W is unity, 
we have 


K \ \ u a-yiyi\ im ~ lex v[-i S £ £i2/t) dydu = e^, 

JW V i,j=l i==l / 

(18) 

whence 

K \w 1 ^ ~ y<y ^ im l eXP ( ~ ^ i t “«%) dydU = lj 

(19) 

miw 1 "*-*« l ‘”' 1 “ p (~ l J. “«“«) . 



4(f)‘ (A= 1,2,3,...). 

(20) 

Combining equations (14) and (19), (17) and (20), we obtain 


J 

to \ i,i=l / 



= e | u ij ~y i y i | lm - x exp ( - ^ dydw, 

(21) 

J 

[ u I % - ViV) I 1 ”* -1 exp f - ^ (J] dydu 

= | im_1 exp^-^ ( { 2£i2/t) dydu 



(h = 1 , 2 , 3 ,...). ( 22 ) 


The sample space W is the product space W(u) x W(y \ u), where W(u) is the 
sample space of the u’s and W{y\u) is formed of the possible positions of the 
point (y v ...,y Q ) for given values of the u’b. Similarly w = W (u) x w(y \ u). If we 
evaluate the integrals in (21), (16) and (22) as repeated integrals, we obtain 

[ exp(-| i ccyuJ 

J wm \ i / 

X r f | | im_1 dy-e\ \u ij -y i y j \ * ro - 1 dy 1 du = 0, (23) 

LJuj(y\u) JlV(,v\u) J 

f exp [" f | % ~ ViVj l iw_1 ( £ dy~\ du = 0 for odd h, 

J W(u) \ i=l / L J to(l/|u) Vi" 1 / J 

(24) 

I «f “IS I I I % - ViV) l 4m_1 ( £ £iVi) dy - a h 

J W(u) \ i,j=l \l=l / 


u, 


W(V N 


y-Z/t2/i 


I Jm-l 


( 2 di/"| du = 0 


'J 


(A = 1,2,3,...). (25) 


Since equations (23), (24) and (25) must hold true for all admissible sets of values 
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of the a { ,j, so, according to the lemma proved in the Appendix, the functions 
within the square brackets in these equations must vanish identically. Hence 


/. 


w{y\u) 


ViVi | im-1 dy = e f | % - ViVj | 4w_1 dy, (26) 

J W(v\u) 

f I Uij-ViVj | im_1 ( 2 £iVi) dy = 0 for odd h, (27), 

f I u a ~ ViVi l im_1 ( 2 £<!/<) h dy = a h f | u i} -I*” 1 " 1 ( £ dy 

J w(v/u) U=1 1 J W(vlu) \1—1 / 

(k = 1,2,3,...). (28) 

In order to simplify the above equations we notice that the matrix ||«y ||, 
being positive definite, can be thrown into the form CC', where C is a non¬ 
singular real matrix. Using the transformation 

I! Vi> • ••>2/g'll = II *i> •••> x a II ^ > 


we get that 


(29) 


r / a Um—i r / a \jm-i 

l-2z?) dx = e\ 1-S*! dx > (30) 

J «|(£B|«) \ 1=1 / J W(x\u) \ 1=1 / 

r / 8 \im— 1 / a \h 

(1 - £ «i j £ £<0^1 dx - 0 for odd h, (31) 

J to(a|«) \ 1=1 / \i=l / 

r 1 8 \im-l / « \ 2 h 

i~2*l (■£&*< dU 

J tu(*|M) \ 1=1 / \l=l / 

/• / a \im-l / a \8fc 

= 1-S^l £fi*i dx (h= 1,2,3,...), (32) 

1=1 / \l=l / 

where || ...,| 9 || = )!^,..., ^||G. (33) 

Now W{x | u) is the region 8 (independent of the u’s) defined by the inequality 

2*?«1. (34) 

i— 1 

Hence the integral in the right-hand side of (30) is a numerical constant, say b, 
and a rotation in the space of the x’s enables the integral in the right-hand side 
of (32) to be written as 

/a \ft r / a \im-i 

g2£?J ^l-JJ*?] dx. (35) 

Hence we obtain the following equivalents of equations (30), (31) and (32): 

r I a \im-i 

1- S®! dx = be, (36) 

Jiu(a|w)\ 1=1 / 

/ a \im—l/ a 

II — S as® I ( £ £i x i I dx — 0 for odd h, (37) 

J \ 1=1 / \i=l / 

f / a \ im—l/a \ 2ft /a \ft 

1-2*? ££i*i <&=M2$ (A =1,2,3,...), (38) 

J \ 1=1 / \i=l / \i=l / 

where the b k are numbers depending only on the choice of the region w. 
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The set of equations (36), (37) and (38) give the necessary and sufficient 
condition that the critical region w should have the properties described in 
Theorem I. Further, equations (37) and (38) may be combined into the 
following: 

C J a Wm-i /a \ / s \ 

l-2>? exp SQ). (39) 

J «!(*!«) \ i= 1 / \i=l / \l=l / 

Now according to (29) we have 

S*5- S u«y i y j = T*l(l + T*) ) (40) 

i=i i,i=l 

where denotes the general element of the matrix || ||. Hence w 0 is the region 

defined by the inequality 

SateZ*/(l + 2»). (41) 

i=l 


Since w 0 is of size e, we must have the same equation as (36) when w is replaced 
by to 0 therein. Hence 

Cl 9 \im—1 Cl « \im-l 

i-2*? dx = (42) 

Juj(x|m)\ i=l / J Wo\ i=l / 

/* / a \ 4 m—l /a \ / e \ 

Letting !-.£*? exp 2 dx = { £ i\ , (43) 

J w,\ i-1 / \i=l / \i—1 / 

we deduce with the help of (41), (42) and the lemma proved by P. L. Hsu in the 
Appendix of hia paper (1940) that 


0 




(44) 


Applying the transformation reciprocal to (29) to the integrals in (39) and (43), 
we get 

I* I Uij-lkVi |* m-1 exp ( S &%) dy^ f | ViVj | im_1 exp ( £ dy. 

Jw(v|u) \i=l / J W,{y\u) \i= 1 / 

(46) 

Hence on multiplying both sides of (45) by K exp (- exp ( — \ ) and 

integrating over W(u) and remembering (16), we have the inequality (8): 

/# 2 H/?o(n 


which was to be proved. 
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II. Multiple Correlation Coefficient 
In this connection the basic elementary probability law is taken to be 


= K(l-p*)* n 


3 Vi 
Vi *11 


Vq 


Xn-q-Z} 


X- 


10 


Vq *gl 


X, 


'00 


x ex P (— \j z ~ SAyi-i £ I (!) 

\ i=i «-i / 

( x ij ~ x ji> x ij ~ X ji)> 

where p 2 = i £ a** A A (2) 

/ i,i =1 

is the square of the multiple correlation coefficient of the population. We have 


^ Hi 
Vi *11 


Vq 


U l0 


Vq 't'ql 


"QQ 


•”]*«]{*- l^&VtVi) ( 3 ) 


and that the square of the multiple correlation coefficient of the sample is 

-K 2 = - S aPytyp 

z i.I-l 


(4) 


The hypothesis to be tested is that 

A =o (5) 

Theorem II. The basic elementary probability law and the hypothesis under test 
being given by (1) and (5), let w 0 be the critical region of size e defined by the inequality 

( 6 ) 

and w be any other critical region whose size is e and whose power function depends 
only on p 2 . Let /?(P 2 ) and /J 0 (P 2 ) be the power functions of w and w 0 respectively. 

Thm AP*)«A(P*)- (?) 

Proof. Suppose that w has the properties described in Theorem II. Then 

\l(n- 0 —2) 


\i(n—0-2) 


( 8 ) 


r / a \i(n-q- 2 ) / a \ 

K jj%| ^-q-V \ z ~ ViViJ exp - \yz - i s a-u dzdydx = <?, 

if J I 1 4(m -^ 2) ( 2 ~, 2 rf* Vi y)j 

x exp ( - 4yz-iS - 2 PiVi) dzdydx 

= i l -P 2 )~ in P{p 2 ) = F(P 2 ), say. (9) 
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Hence, on developing the left-hand side of (9) into a power series in the /?’s, 
we have 

f / a \J(m—a— 2 ). 

|J(n-4-2)| 2 _ £ 

Jto \ ! 

x exp ( - £yz - £ £ ) dzdydx = 0 for odd A, (10) 

\ 1 \i»x / 


JLf 


\K»-a-2) 


% p-«~ 2) (z-^ p & Viv)j 

x exp | - \yz - i p i) dzdydx 


~T\T<M ah{p2)h ( A = W,-.), (H) 

where the a ft are numbers depending only on the choice of the region w. 

On the other hand, since the integral of (1) over the whole sample space If 
is unity, we have 

r /a \i<n-a-*) 

K ) W \ X V \f ~ i 2 &ViVij 


whence 


x exp ( - \yz -£ £ a i} x {j - £ dadyd* = (1 -p*)~' n , (12) 

\ i.j-i i-i / 

C /a \l(n-a-a> / a \ 

-K J ^ Ki I 1<n - 4-2) ViH y<2/ 7 exp I - lyz - £ 'p^ct {j x i} J dzdydx - 1, 

(*“^2 *0VtVi) 


K 
(2 h) 




Kn-a- 2 ) j 




(13) 


/ 8 \ / a \2A 

xexp^-lyz-^S^^J^ dzdydx 

r{\n+h) 


(P a ) ft (A - 1,2,3,...). (14) 


h!.F(£n) 

Combining equations (8) and (13), (II) and (14), we obtain 

J w I »# l to_4_2) S ^ iW/j 

xexp(-iyz-| £ a y * w )dzdj/d!a; = e f (...)dzdydx, (15 
\ <■ i-i / Jw 

f , ... / « .. \«8V-fl-2) 

J J *« | ^ f £ 1 **' Vi Vi j 

/ a \ / a \ 2 ft 

x exp j — |yz- ^ £ | S Ay< 1 

= a ftj (■■■} dzdydx (h = 1,2,3,...), (16) 
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where the unwritten integrands in the right-hand sides are the same aB those 
in the left-hand sides. 

As before we argue that W = W(z, x) x W{y\ z, x), w - W(z, x) x w(y | z, x) and 
evaluate the integrals in (15), (10) and (16) as repeated integrals. It follows that 


f 

IV (z, x) 

Xij 

2) exp | - 

lr ‘-*<&“»*«) [j 

/ « 
2- S 

U>(l/|z, x) \ vi= 

x^ViVi 


2) 

dy 




r i 

Q. 


i(n-3-2) ' 



(17) 



— 

5 Iz- 

2 


dzdx = 

0, 




J w(vlz,x) \ 







L^.as) 

I* 01 ” 3- 

2) exp (- 

a 

S a 







\ 

i,i =1 







r c 

/ a 


■2)/ a 

\ft 

"1 





_ J w(y\z, x) 

2- S 

\ i.j = l 

wyiVi 1 


dy \dzdx = 

0 for odd A, 

(18) 

( 


2) exp ( — 

iyz-J S a«a«^ 






W(z,x) 

\ 

W-l 

/ - 








- r 

/ 8 

\i(n-Q 

-2)/ 

8 \2A 







2- E *«y<y«l 


2 Aft 

dy 





-J w(y\z, x ) 

\ i,i=1 

1 

\ 

i=l / 




, 

/ 

Of 

\ iin-a-i.) 

/ « 

\2/n 


0 (A = 




(* 


2 Ay* 

\<-i 

J 

dydzdx = 

1,2,3 

,...)• 


(19) 


According to the lemma proved in the Appendix, the functions within the 
square brackets in the above equations must vanish identically. Hence 


[• / 8 \i(n-q- 2 ) f 

{*- S dy — e\ {...)dy, 

J uj(vlz,x)\ i,j= 1 / J X) 

r l 8 \i(n-q- 1 ) [ 8 \h 

[z- S sfi yi yA ( 2 PiVi) dy = 0 for odd h, 
J ui(viz,x) \ i,i=1 / \i“l ' 

r / 1 \Kn-fl-2)/ 8 \2ft 

(*- 2 a^y<yy) EAy<) dy 

Jw(vlz,x)\ i,i= 1 1 \t=l / 




X) 


(•••)<% (A = 1,2,3,...). 


( 20 ) 

( 21 ) 


( 22 ) 


In order to simplify the above equations we notice that, since the matrix 
|| Xy || is positive definite, it can be thrown into the form CC', where C is a non¬ 
singular real matrix. Using the transformation 




(23) 
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we obtain 

/ o \}(»~<z-2) r 

11- S I dt — e\ 

(24) 

J w(t\z, x) 

\ i=X 7 

J W«k*' 


f / 

1 \ J(»-S-2) 

/ 1 \ft 

(25) 

1 (1- 

-S«! i 

S t7 { \ dt = 0 for odd h, 

J W(tlz,X) \ 

i= 1 / 

\i= 1 I 


r la \i(n-i~ty, 

< 1 \2 ft 

: f {.>.)dt (h = 1,2,3,. 

• ■). (26) 

1 I 1 - s*2) 

S Tih) dt = 

J U)(t\B, X) \ »” 1 / ' 


J W(Uz, x) 


where 

lln.--.Tj n = 




Now W(t | z, x) is the region S (independent of z and the x’s) defined by the 
inequality 

£<?<!. (27) 

i=i 

Hence the integral on the right-hand side of (24) is a numerical constant, say b, 
and a rotation in the space of the t's enables the integral on the right-hand aide 
of (26) to be written as 


11 \h r I a 

St? l-s*l t\ h dt. 

\i=l- / Js\ {-1 / 


(28) 


Hence we have the following equivalents of (24), (25) and (26): 


f 



C 1 1 \ -8-2) 

1-S*? dt = be, 

(29) 

f ( 

J w(t(n,x) \ i**l / 

8 \Km-8-2) 11 \h 

1- S 1?) I S r t tA dt = 0 for odd 

(30) 


t-i / \i=i / 


Kn-a- 2 )/ a \ 2 ft / g \ft 


i 1 -#) 

| ( { Sn^j * = ^^ { ST?j (A =s 1,2,3,...), 

(31) 


where the b h are numbers depending only on the choice of w. 
Equations (30) and (31) may be combined into the following one: 

r j a \l(n-s- 2 ) in \ l i \ 

1“S<! exp - Sr^ \dt=G St? • 

JwCt|*,a)\ 1=1 / \ i = l / \{-l 1 

Now, by (4) and (23), we have - 

& = S tf, 

i=l 

consequently, the region utfis defined by the inequality 


(32) 


(33) 
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Since w 0 is of size e, we must have the same equation (29) when w(t | z, x) is 
replaced by w 0 therein. Hence 

1-S«! *= dt. (35) 

Jrtt k®)\ i-i 7 Ju,,\ i=i/ 

On setting exp - £ dt = O 0 £ r? , (36) 

we infer, with the help of (34), (35) and the lemma proved by P. L. Hsu in the 
Appendix of his paper (1940), that 

«(.S<!)<^o(.S<!). (37) 

Hence, using the transformation reciprocal to (23) to the integrals in (32) and 
(36), we have 

f I a \ 4 (u-g— 2 ) / a \ r 

(z~ £ exp - £ ^yAdy^\ (•••)%• (38) 

J -Uiftila,®) \ 4,J=1 / \ 4=1 / 

Multiplying both sides of (38) by 

K( 1 - p 2 )*™ \ x tj | exp | - \yz -* £ a y 

and integrating over the space If (z, x) and remembering (9), we obtain 

/?(P a XA>< (/>*)• 

Therefore Theorem II is proved. 

I am gratefully indebted to Dr P. L. Hsu for putting this problem before me 
and for his helpful suggestions both in the course of my research and in preparing 
this paper for publication. 


APPENDIX 


Lemma. Let E(x) be the set of points (x lv x 12 ,x m ) for which the symmetric 
matrix ||aty || is positive definite. Then 

f | x t Mx) | exp ( - £ %) dx < co (i,j = 1, ...,q) (1) 

Jw \ 4=1 / 

and 

f ^(cc)exp ( - £ a n x ij\ dx = 0 throughout E(a) (x i} = x }i , a if = a H ) (2) 
J E(x) \ 4=1 / 


imply that 


<fi(x) — 0 almost everywhere in E(x). 



Proof, Suppose that both (1) and (2) are true. Since the matrix ||4+4II> 
where 4 - 1 and 4 = 0 (i 4=j), 4 = 4> positive definite for all sufficiently 
small real 0’s* so, by (2), 


£(x) 


-S% exp - = 0 




i=l 


(4) 


for all sufficiently small real 0's. By (1) the left-hand side of (4) is an analytic 
function of each of the 0’s in the neighbourhood of the imaginary axis. By 
analytic continuation (4) must remain true for all complex 0’s with sufficiently 
small real parts. In particular, 


!b(x) 


M exp (- S % exp If- 1 S fyaJ dx = 0 (4 = y (5) 


\ i°l 




for all real values of h. Hence, by the well-known property of the Fourier 
transform, 

<j>( a:) exp (- S %j “ 0 almost everywhere in E(x), 


which implies (3). 
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(i) A recurrence relation for the semi-invariants of 
Pearson curves 

By M. G. KENDALL 


The Pearson curves are defined by the differential equation 

d y(a+x)dx 

Multiplying by e fx (6 0 + b x x + b t x 2 ) and integrating over the range of the distribution, wo have 


je tm y(a+x)dx = j(b 0 + b t x + b 2 x 2 ) e tx dy 

= C( & o 4- + & a aJ a ) e ta! 2 /] — J ydx + 2b 2 x + t(b 0 + b 1 x + & 2 a: a )}. 


At the oxtromes of the distribution we may suppose the expression in square brackets on 
the right to vanish and hence 


Je‘*y{<x + 6 1 + & 0 « + (l + 2& l i + & 1 f)a; + f> ii te l! }da; = 0. 

The moment generating function of the distribution, M(t), is given by 

M(t) — je ig ydx, 

and hence = [e. ix xydx, etc. Thus from (1) 
dt J 

d 2 M dM 

^ ^ ^ ( a i" d &o 


•( 1 ) 


•( 2 ) 


a linear differential equation of the second order, which may also be regarded as defining the 
Pearsonian system. 

Incidentally, it would be interesting from the theoretical view-point to consider classes 
of frequency distributions defined by differential equations in their moment or semi-invariant 
generating functions. 

So far as I know there is no solution of (2) in ordinary functions which would permit of 
the explicit expression of the co-efficient of i r in M(t); but from a consideration of the co¬ 
efficient of V in (2) we have 

{1 + {r + 2) 6 2 }/^ +l + {®+(r + 1) fril/Ad^o/^v-i = 0, .(3) 


the well-known recurrence relation between the moments of Pearson curves. 

Some simplification of this expression is possible by the choice of a particular origin in 

certain cases. If the roots of , . , , „ . 

ba + b^x + b^x 2 = 0 


are real, it is possible by a real linear transformation to transform the equation defining the 
Pearson curves to one which does not involve & 0 . With the origin defined by this transforma¬ 
tion, we have ( 1 + (r + 2)&J ^ +1 + {a + (r +1) 6J = 0, 


giving 


, _ (a + rbj) (a + f — Ifq)... (a + &i) . 

^ (l+7+l6 1 )(l+r6,)...(I + 5»*)’ 


.(4) 
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Putting K = logM in (2), we have for the semi -invariant generating function 

+ (^) j + (l + 26 g +6 1 0^+(« + 6 i + V) = 0- .(6) 

This is not linear, and it appears therefore that there is no simple recurrence relation among 
the semi-invariants as among the moments. The equation its similar in character to that 
known as Riocati’s ancl the usual way of solving it would be to return to the linear equation 
(■2) from which it was derived. 

Taking an origin at the mean (q = 0) and considering the eo-officient of t r in (5), we have 


fyjrtV+i 

[r 


*V+l , L *V-1 , K t *r-2 , , K r ~l I (1 4.9^ \ Kf — Q 

:T)1 + 6 nri(^ + W-8ri + -" + ^W.| + ( ri + 1 (r~l)!- 0> 

u + (r + 2) b*> x r+1 + rb x K r + rb, ^ { 1 j k % q_i 

+ ^' 1 )q*r r _ a +... + ( r ^ X )q + iq-y + • • • + ( r ~ = 0, .... 


.( 6 ) 


with the initial relation K t = ~bj(l -fi 36 8 ). 

Equation (6) seems to be as simple a recurrence relation as we can expect for the expression 
of a semi-invariant in terms of those of lower order. 


(ii) A comparison of annual and biennial inflorescences of 
Daucus carota (wild carrot) 

By WILLIAM DOWELL BATEN 
Michigan State College 

Introduction 

In 1932 seeds from Michigan and Indiana were gathered from Daucus carota for the purpose 
of studying environmental effects on the numbers of pedicels and bracts per inflorescence 
from plants grown from Michigan and Indiana seeds. In 1933 these seeds were planted in the 
botanical gardons of the University of Michigan in the green house and later planted outside. 
In 1933, 44 % of the plants bloomed; in 1934,17 % of those that did not bloom the first season 
survived the winter and bloomed. Results of this study were published by the present writer 
(1934) in an article entitled "A statistical study of Daucus carota ”, in which the numbci’H 
of pedicels and bracts on annual and biennial inflorescences coming from these seeds were 
compared. At the end of the article K. Pearson pointed out that since the seeds were taken 
from many plants in the wild, some of the seeds might have come from flowers blooming the 
first season and others from those blooming the second soason, that one did not know how 
many annual and biennial seeds came from the two states and that the comparisons might 
not be the same if this was considered. 

To overcome this just oriticism seeds were taken from one plant near Ann Arbor, 
Michigan in 1936. These were planted in 1937 in the greenhouse at Michigan State College and 
later planted outside in rows 3 ft. apart and 3 ft. apart in the rows. During the latter part of 
the summer, counts wore made on plants blooming the first year of the number of branches, 
the number of inflorescences, and the number of primary pedicels and bract per inflorescence 
on the stem and first eight branches below the stem terminal cluster. During the summer 
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of 1938 similar counts were made on the plants blooming the second year. The object of 
this article is to compare the counts pertaining to the annual and biennial inflorescences. 

During the first flowering season 00 % of the plants bloomed. At the end of this season 
the plants which did not bloom appeared to bo in good condition for the coming winter. 
In 1938 biennial flowers appeared much earlier than the annual flowers in 1937. Counts of 
the annuals were mado in 1937 in August and September; counts of the biennials were made 
in July and the first part of August. 

The terminal inflorescences on the stem will be designated by 7', the first branch terminal 
inflorescence by A, the first non-terminal inflorescence on the first branch by A v etc. Branches 
ore considered in descending order below the stem terminal. According to these notations, 
£> 3 represents the third non-terminal inflorescence on the. fourth branch. Umbels in this 
article will always mean primary umbels and pedicels or rays will always mean primary 
pedicels or rays. 

Size op annual and biennial plants 

The following averages pertain to the number of branches and inflorescences (including 
buds) of annual and biennial plants. 



Annuals 

Biennials 

Parts 

(1937). 

(1938) 

Average no. of branches 

16-3 

20-3 

Average no. of inflorescences 

122-7 

282-5 


These averages indicate that the biennial plants were much larger as to number of 
branches and inflorescences than the annual plants. The second year herbs were considerably 
taller than those blooming the first season. 

In 1938 most of the branches used in making the counts had four umbels, whoso parts 
could be enumerated; in 1937 very few of tlieso had four umbels whicli were mature enough to 
use. Very few of the first branchos belonging to 1937 plants produced more than two non¬ 
terminal umbels; a good percentage of corresponding branches of 1938 plants possessed more 
than two. Counts were made on seventy-seven jdants during the first season and on seventy- 
six during the second. In the second summer there were several plants with more than 500 
inflorescences and one with 796; the largest in 1937 had 183. 

In 1937 there was 37-7 % of the herbs with at least eight branches; in 1938 there was 
72-4 % with at least eight similar branches. In the first flowering season 46-8 % of the plants 
had at least six branches; during the second season 90-8 % had at least six branches. There 
were 74-0 % of the annual plants with at least four branches and 97-4 % of the biennials with 
at least four. Those figures show that the biennial plants were more completely filled out than 
the annuals. 


Size of inflorescences 

Table 1 contains averages pertaining to the number of bracts per umbel on the stem and 
the first three branches. On the average the number of bracts on tho stem and branch terminal 
clusters of biennials are significantly larger than similar annual clusters. The average size 
(in number of bracts) of stem umbels for annuals was 10-9 bracts; that for biennials was 11-9 
bracts. The, average number of bracts per branch terminal was less than 10-3 bracts during 
1937 and greater than 11-4 bracts during 1938. These figures and figures pertaining to the 
first eight branches indicate that the averages of the number of bracts on the majority of 
the biennial clusters are significantly larger than similar averages with respect to annual 
clusters. 
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Table 1. Averages and standard deviations pertaining to the number of bracts 
per umbel on the stem and first three bramhee 

1937 



i 1 

A 

A 

^-2 


B 

B 1 

B s 


G 

Ct 

c a 

C 8 

Number 

77 

77 

55 

18 


76 

69 

49 


68 

62 

39 


Average 



9-4 

8-6 

— 

10'4 

9'6 

9-6 

— 

10-3 



-- 

Standard 

deviation 

1-22 

1-39 

1*22 

1-01 


1 ‘65 

1-33 

1-30 


M2 

1-21 

1-27 



1938 


Number 

— 

70 

76 

33 

— 

27 

7 

76 


29 

— 

15 

71 

30 

26 

21 

Average 

11-9 

11 ’4 

9-8 



11-8 





10-2 

10-9 

11-2 

Standard 

deviation 

1-36 

120 

M3 

1-63 

1-77 

1-20 

1*41 

1-44 

M6 

M0 

132 

1-46 

1-23 


Biennial stem terminals had on the average significantly more pedicels than stem 
annuals; these averages are; 

Annuals Biennials 

56-6 pedicels 67-8 pedicels 

Branch terminals of biennials have on the average significantly more rays than similar ones 
on annuals. Fig. 1 allows the eye to see at once how these averages compare; the heights of 
the bars on the left represent the averages for the annuals. The bars on the left are shorter 
in every case. 



Fig. 1. Averages pertaining to number of pedicels per branch terminal 
umbel for annuals and biennials, a. annuals; b, biennials. 


Many of the averages of the numbers of pedicels on non-terminal biennial clusters are 
significantly larger than those belonging to corresponding annual clusters. The above indi¬ 
cates that biennial inflorescences (in number'of bracts and pedicels) are significantly larger 
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than corresponding annual inflorescences, showing again that Daucus carota herbs blooming 
the second season are on the average much larger than those blooming the first. 

On examining average number of pedicels per umbel it is found that branch non-terminal 
umbels have on the average a smaller number of pedicels than the corresponding branch 
terrfiinals; for example, A 1 and .4, are significantly less (in number of rays) than A. This was 
true for the other branches. The 1938 averages for C, C v G t and <7, are as follows: 

C Cj 0, C s 

61-7 rays 46-3 rays 48-3 rays 61>1 rays 

Similar figures were found for the other branches. The averages pertaining to pedicels are 
shown in Table 2. 


Table 2. Averages and standard deviations pertaining to the number of pedicels 
per umbel gn the stem and first three branches 


1937 



T 

D 

A 


H 

B 

Br 

b 2 

Bs 

C 

Oi 

o, 


Number 

77 

77 


18 

a 

76 

59 

49 


68 

52 

39 

■■ 

Average 

55-6 

53-2 

44-6 

40-2 


54-3 


46-5 

— 

55-6 

46-8 

45-6 

— 

Standard 

deviation 


10-98 

7-40 

6-96 

I 

11-32 

8-81 

9 37 


12-20 

8-70 

8*18 



1938 


Number 

76 

76 

33 

27 

7 


30 

29 

15 

M 

30 

26 

21 

Average 

64-26 


ifafl 

48-07 

48-43 

60-77 

47-23 

49-45 

47-13 


45-33 

49-32 

51-14 

Standard 

deviation 

13-10 

9-78 

9-98 

6-27 

8-79 


9-93 

10-31 

10-60 

■ 

11-62 

10-07 

9-07 


Correlation 

The Pearson linear correlation coefficient between the number of bracts and the number 
of rays for various umbels for annual and biennial umbels are as follows: 


Umbel ... 

T 

A 

B 

O 

Annuals 


0-410 

0-496 

0-571 

Bionnials 

0-665 

0-612 


0-549 


All of these coefficients are significantly different from zero, showing that there is a definite 
relation between the number of bracts and the number of rays. There are no significant dif¬ 
ferences between the correlation coefficients pertaining to annuals and biennials except that 
for T which is barely significant at the 5 % level. These values suggest that the size of the 
plant and season do not effect the relation between the number of bracts and rays per umbel. 
Similar figures were found in other investigations of this species (Baten, 1934). The position 
of the umbels on the plant also does not affect the relation between bracts and rays. 
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The coefficients of correlation between the number of bracts on T and on the other 
clusters pertaining to annuals and biennials arc about the same and are significant, indicating 
a real association between bracts on stem and branch terminals and similarly for rays. The 
values of r fs are: 



Bracts 

Rays 

Annuals 

Biennials 

Annuals 

Biennials 

r TB 

0-644 

0-696 

0-849 

0-741 


Sizo of herb and season have no effect .on the relation between bracts and rays on T and on 
A, B and C. 

The amounts of dependence of the number of bracts on branch non-terminals have on 
the number of bracts of terminals for the first three branches were obtained by the correlation 
coefficients between these respective numbers. There are no significant differences between, 
these coefficients, suggesting that the size of plants and seasons do not affect tho relation 
between the number of bracts and rays on branch terminals and branch first non-terminals. 
This also was true for rays. 

The following figures are the coefficients of correlation between the number of bracts and 
rays on first and second branch terminal inflorescences. 



Bracts 

Rays 

Description 










Annuals 

Biennials 

Annuals 

Biennials 

r AB (interclass) 

0-748 

0-734 

0-916 

1 

r AB (intraclass) 

0-701 

0-710 


0-786 


These values indicate no significant differences between the correlations pertaining to annual 
and biennial inflorescences. They do suggest a rather high correlation between the number of 
bracts and rays on first and second branch primary umbels, 

The relation between floral parts on B and T and A is manifested by the following 
multiple and partial correlation coefficients. 


Description 

Bracts 

Rays 

Annuals 

Biennials 

Annuals 

Biennials 


■ 

■ 

0-796 

0-666 

0-683 

0-662 

1 

■ 

■ 1 


Again there are no significant differences between these coefficients indicating that the 
amount of relationship remains the same between these floral parts pertaining to annual and 
biennial inflorescences. 
















Suggestions tor further study 


It might be argued that biennial plants should naturally be larger in every way since 
these plants had a longer time in which to establish themselves than the annuals; that the 
root system of the second season plants are much better for supporting the plants than those 
of the first season. This may be true. To overcome this criticism and to make more reliable 
comparisons between annual and biennial plants and inflorescences it might prove of real 
value to secure seeds from one plant as done in this study, save seeds from the annual and 
biennial flowers, and plant these seeds under the same environmental conditions and then 
make comparisons between the counts made in this study. Seeds should be planted in the 
fall and in the spring. Investigations along these lines may produce more interesting results 


Summary 


This study has shown that: 

1 . The average number of branches on biennial inflorescences of Daum carota is larger 
than the average number on annual inflorescences. 

2 . The average number of inflorescences on biennials is larger than on annuals. 

3 . The average number of bracts per biennial clusters is larger than the average on annual 
clusters. 

4 . The average number of primary rays on biennial umbels is larger than that on annual 
umbels. 

6 . The correlation coefficient between bracts and rays is about 0*60 for annual and 
biennial clusters. 

6 . The size of plants and seasons (first and second) do not affect the amount of correlation 
between floral parts on stem terminals and branch terminals. 

7. The amount of correlation between certain floral parts on branch terminals and non¬ 
terminals is about the same for annuals as biennials. 

8 . The amount of correlation between bracts and rays pertaining to first and second 
branch primary umbels is about the same for annuals as biennials. 
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THE LAWS OF CHANCE, IN RELATION TO THOUGHT 

AND CONDUCT 

INTRODUCTORY, DEFINITIONS AND FUNDAMENTAL 
CONCEPTIONS 

BEING THE EIRST OE A SERIES OE LECTURES DELIVERED BY 
KARL PEARSON AT GRESHAM COLLEGE IN 1892 

[It >s just fifty years since Karl Pearson took up the part-time appointment of Lecturer in 
Geometry at Gresham College in the City of London. This appointment, which he held 
during tho years 1891-4, involved the delivery of certain courses of public evening lectures. 
His first course on ‘The scope and concepts of modem science’, commenced on 3 March 
1891; much of its material was afterwards published as The Grammar of Science. Later 
series of lectures dealt with ‘The geometry of statistics’ and ‘The laws of chance’. The 
lecture printed below was found among Pearson’s papers; it was delivered on 1 November 
1892 and wiia the first of a series devoted to the theory of probability.— Ed.] 

In everyday life we feel, and justifiably feel, irritated with the man who is 
perpetually asking us to define the words we use. We are wont to reply that we 
use our terms in the ‘ ordinary 1 or ‘ customary ’ sense. As a general rule mankind 
understand each other in ordinary intercourse and do not stop to discuss the 
meaning of words. But in important and delicate business or in legal contracts 
the accurate definition of the words employed becomes of the utmost weight. 
Even more urgent still is clear definition in the matter of scientific investigation. 
It will not do here to appeal to that vague or floating sense of a word, which is 
termed the ‘ordinary’ or ‘customary.’ one, for hardly any two persons use the 
same abstract word for precisely the same range of ideas. What is still more 
remarkable is the change which the meaning of words undergo in a few genera¬ 
tions, so that even the language of our grandfathers requires to be read in the 
light of their (and not our) customary use of words. Take words apparently, so 
simple as Nature, Right, Belief, Law, Chance: what a gulf separates the field of 
ideas we associate with these terms in 1892, from that which was their ‘ ordinary ’ 
or ‘ customary ’ value some century ago, i.e. in the days of the French Revolution! 
Or, again, how different is the modern scientific use of the words ‘ nature ’ and 
‘ law ’ from the sense often to-day put upon them in popular or current language! 
Indeed I am inclined to think that the irritating person, who insists in everyday 
life on definitions, is after all rather a social blessing than a social nuisance—for 
fir my experience 90% of the wordy discussions which arise in ordinary life are 
due to the fact that the disputants have not first fixed the sense in which they 
are using some fundamental conception. 

In the present course of lectures, which will deal with the theory of probability, 
with chance, luck and the vexed question of the scientific measurement of belief, 
we shall have to be especially careful that we clearly define and appreciate our 
Biometrika xxxh 7 
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fundamental conceptions. This insistence on definition must be the starting-point 
of any really scientific discussion, and I want to urge you all to start the study 
of this or any other subject by trying to clearly define its scope and terms. You 
must in this respect ‘list to what the friar preaohes and not to what he does for 
according to a sharp-eyed reviewer I have myself been guilty of publishing a 
book, in which no definition of chance itself was given! I will endeavour to 
supply that omission in to-day’s lecture. But first I want to point out the rela¬ 
tion between the subject of my present course and the topics of the two earlier 
ones on the Fundamental Concepts of Science and on Statistics. The relationship 
is a very close one indeed, far closer than might be imagined on a cursory 
examination. We shall find that statistics are the practical basis of much, if not 
all scientific knowledge, while the theory of chance is not only based on past 
statistics but enables us to calculate the future from the past, the very essence of 
scientific knowledge. There is a close relation between provable and probable ; the 
analysis of 20,000 tosses of a coin will help us to penetrate into the very laboratory 
of Nature whose complexity presents us with results strikingly akin to those of 
a game of chance; while the record of a month’s roulette playing at Monte Carlo 
can afford us material for discussing the foundations of knowledge. That things 
apparently so diverse should be so closely related may strike some of you as 
paradoxical, and indeed the ground we are to venture upon abounds in diffi¬ 
culties and pitfalls. It is one where criticism and controversy have been very 
rife, but at the same time have been fruitful of results and have contributed'to 
clearness of thought. We shall find well-marked divergencies of opinion, charac¬ 
terizing two different schools, which push to extremes in different directions. 
Based on the-researches of Laplace and Quetelet, we find De Morgan, John 
Stuart Mill and Stanley J evons pushing the possibilities of the theory of probability 
in too wide and unguarded a manner; while in the opposite camp we find George 
Boole and Dr Venn taking a severely critical and in some respects perhaps too 
narrow view of them. As in many other cases the safe roacj is probably the middle 
road, and this road is that which .1 conceive Prof. Edgeworth of Oxford to have 
pointed out. For those of you who may have time for reading I would strongly 
recommend a comparison of Chaps, x-xn of Stanley Jevons’ Principles of 
Science with Chaps, vi-xi of Dr Venn’s Logic of Chance and Prof. Edgeworth’s 
Philosophy of Chance published in Mind for 1884. I shall refer to the opinions 
of these writers in the course of our work, but you would find the subject of 
chance as treated by them enticing in the extreme, and they will give you far 
more amply than I can do in these lectures the various features of the controversy. 

While dealing with the subject of books I may also refer to: 

De Morgan; Formal Logic (1857). Here Chaps, ik-xi are closely connected 
with the topics of our. first two lectures. 

De Morgan: An Essay on Probabilities (1838). This is still a useful and sug¬ 
gestive little book, although it requires some mathematical knowledge. 
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Whitworth: Choice and Chance (3rd ed. 1878). An excellent book with which 
to approach the elements of the mathematical theory. 

Wbstergaard Die Grundzuge der Theorie der Statistic (1890). By far the best 
textbook on the relation of statistics and. probability for those who read 
German. 

Now I want to restate in the first place some of the conclusions I placed 
before you in my first course of Gresham Lectures. I do not want you now—any 
more than I did then—to accept those conclusions as your own but rather to 
probe and investigate them for yourselves, and thus ascertain whether they 
form a basis sufficiently sound for the superstructure placed upon them. The 
conclusions to which I want to draw your attention are those concerning the 
material of science, scientific law and cause and effect. In the first place the 
material of science consists of certain groups of sense-impressions, which we 
term phenomena and in which we mark not only a certain permanency but a 
routine. When we find a certain sequence of sense-impressions frequently re¬ 
peating itself, we speak of any antecedent sense-impression as a cause, any 
subsequent one as an effect. A, B, C, D, E, F being a succession of sense- 
impressions, which repeats itself, A, B, C, D, E are all termed causes of the effect 
F. A scientific law or formula is a statement which enables us to resume or 
describe in brief language a routine sequence—or many such routine sequences— 
of causes and effects. As I pointed out to you, a scientific law does not enforce a 
sequence, it merely describes what-takes place. No law of gases causes or enforces 
the boiling of a kettle of water, when placed on the fire; it merely describes how 
it boils. How then do we know that a kettle of water will boil, if placed on the 
fire? The answer is a very simple one, our knowledge of what will happen is 
based upon past experience. Statistics of past experience, our own or that of 
other men, are the basis of our knowledge of all cause and effect, of all know¬ 
ledge of phenomena. Here you have the kernel to statistics as the basis of 
knowledge. The statistics are formed in a rough practical manner, but are none 
the less real for all that. Take any sequence of phenomena such as a kettle of 
water boiling which has been.long enough on the fire. We have behind us the 
invariable experience that kettles in like positions do boil, and we say we know 
that this kettle will boil. If it does not we expect that some portion of the 
customary sequence fails, the fire has gone out, there is no water in the kettle, 
or there is somewhere a breach in the ‘ group of causes ’. But in our statement 
about the kettle there are really two important factors, there are the statistics 
of past experience, and the assumption that these statistics will apply to the 
future. There is no scientific reason why the same groups of causes should always 
be followed by the same effect. Indeed, a distinguished American mathe¬ 
matician, Mr 0. Pierce, has gone so far as to support the view that the causes, 
A, B, C, D, E, may be followed by F or G, indifferently. There is no logical or 
intellectual proof that like causes will be followed by like effects. It is purely a 
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result of experience. Statistics show us the prevalency of routine in the past, and 
these statistics are the first basis of our knowledge. That what has held in the 
past, will hold for the future, is again a statement for which there is no proof; 
it is the outcome of our experience of what has happened in the pasts, which 
were at an earlier date futures. Hence our inferences with regard to natural 
phenomena are essentially based on statistics—namely statistics of what has 
happened in the past, and the experience that within certain ranges the statistics 
of the past repeat themselves in the future. Now I want you to grasp this.point 
very clearly, for we are coming close to the relationship between statistics, 
knowledge, belief and chance. What do we mean when we say that 106 boys are 
born as compared with 100 girls 1 Or when we assert that such will take place 
next year"? Why simply this that the statistics of past years for a very great 
number of births give us boys and girls repeatedly in these proportions, and 
further experience—in other words statistics again—shows us that such 
statistical ratios do not change suddenly and abruptly, the results calculated for 
a period of four or five years, hold very closely for the following four or five 
years. Or, again, when we say that we know that the sun will rise to-morrow— 
we are just as much appealing to past experience of the action of the sun and 
past experience of the occurrence of routine, as when we appeal to the statistical 
appearance of births. The law of gravitation does not enforce the rising of the 
sun, it is merely a scientific description of what we observe in the motion of the 
planets. Suppose the sun had not risen on one well-authenticated occasion in 
our experience, and on one only, we should then be slightly less confident in our 
assertions as to its appearance to-morrow. Our knowledge would then have been 
weakened down into some very strong form of belief, The more frequently the 
sun had omitted to rise, the less strong would be our certainty with regard to its 
conduct to-morrow, until we passed through every shade of belief to disbelief 
itself. Or let us take a more tangible and possible case. A friend is leaving us, 
say in Chancery Lane at 4 o’clock in the afternoon, and we tell him that he will 
find a Hansom cab at the Fleet Street comer. There is no hesitation in our 
assertion, We speak with knowledge, because an invariable experience has shown 
us Hansom cabs at 4 o’clock in Fleet Street. But given the like conditions 
within reach of a suburban cab-stand, and our statement becomes less definite. 
We hesitate to say absolutely that there will be a cab: ‘ You are sure to find a cab ’, 
‘I believe there will be a cab on the stand’, ‘There is likely to be a eab on the 
stand’, ‘There will possibly be a cab on the stand’, ‘There might perhaps be a 
cab’, ‘I don’t expeot there’ll be a cab’, ‘Its very improbable’, ‘You are sure not 
to find a cab ’, etc., etc. In each and every case we go through some rough kind 
of statistics, once we remember to have seen the stand without a cab; on occa¬ 
sions few and far between, * perhaps on an average once a month ’, ‘ perhaps once 
a week’, ‘every other day’, ‘more often than not there has been no cab there’. 
Certainty in the case of Fleet Street passes through every phase of belief to dis- 
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belief in the case of the suburban cab-stand. If once a month is the very 
maximum of times I have seen an empty cab-stand, my belief that my friend 
will find a cab there to-day is far stronger than if I have seen .it vacant once a 
week. A measure of my belief in the occurrence of some event in the future is 
thus based upon my statistical experience of its occurrence or failure in the past. 
When in a wide range of experience there has been no experience of failure, then 
as in the case of the cab in Fleet Street, or in the case of the sun rising to-morrow 
our belief becomes so strong that we speak of knowing. But all this knowing 
really amounts to is a very high, or even the highest possible, degree of probability. 
I know that the three angles of a triangle together make two right angles, for this 
lies in my definition of triangle, and belongs to the field of mental conceptions 
and not to physical phenomena. But of the physical universe I can only say I 
believe such and such things will occur, and the degree of my belief is measured 
in a rough approximate way by the statistics of past occurrence and failure. 

We can see this better, I think, by returning to the definite case of the cab 
on the stand. Once a week on the average of a long experience I have seen the 
stand empty. Thus for every six occasions there is a cab, there is one occasion 
that there is not a cab. Had I sought for a cab at the given hour for a long 
period I should have been successful six times in every seven. We then define 
the ratio of the number of successful instances to the total number of occasions 
as the probability or chance of finding a cab—in this case the chance is 6/7. Thus 
the chance of an event is the numerical measure of past experience. It is based 
essentially on statistical information. How wide must be the range of information 
on which the chance is based we will consider later, for it involves very many 
important points. At present we have the following simple rule :■ Taking the 
statistics of the occurrence, find the number of favourable instances and divide 
them by the total number of instances and this is the chance of the event. 
Returning to our cab-stand, suppose that only once in four weeks I have seen 
it empty at 4 o’clock on the average, then the chance of the event, finding a cab 
at 4 o’clock is 27/28—i.e. in the long run 27 favourable instances per 28 
occurrences. 

Now, if I have found only one failure in 28 occurrences my hope of finding 
a cab on a particular occasion—my belief in there being a cab—will be far 
greater than if I have-found a failure once in 7 occasions. Thus my belief is in 
some way related to the chance; if I know the chance is greater, my belief is 
greater. Prof. De Morgan has asserted that the proper measure of belief is 
chance, and according to him my belief in the two cases cited above would be 
as 6/7 to 27/28, or as 8 to 9. He thus reaches an exact numerical appreciation of 
belief, what might be termed a scientific measurement of belief. 

This view of De Morgan’s has been severely criticized by Dr Venn. He asserts 
that chance is something objective or physical—is based on statistics of the 
occurrence of a physical phenomena—while belief is something psychical and is 
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largely determined by the emotional and nervous temperament of the individual 
man. In other words it is subjective and not objective. According to Dr Venn 
probability deals with the laws of things, while according to De Morgan prob¬ 
ability has to do with the laws of our thought about things. 

Now I think we must agree with Dr Venn that it is impossible to set an 
absolute numerical value upon the beliefs of human beings in practical life. No 
one will venture to say that one of his beliefs is exactly nine times as strong as 
another. Perhaps the only practical measure we can form of the strength of 
beliefs is the readiness of men to act upon them, and the impetuous or credulous 
man will risk as much where the chance is small, as the sluggish or sagacious 
man where the chance is large. At an important crisis he will risk finding a cab 
on a chance which would have induced the prudent man to order one beforehand. 
Clearly then as applied to the beliefs of practical men in actual life, Dr Venn is 
right in asserting against De Morgan that we cannot put an exaot numerical 
value on belief. On the other hand I think we must question whether chance 
can conveniently be treated as peculiar to things. The means by which statistics- 
are taken in practical life are human and they become subjective and individual 
in the process of taking and applying them. Besides this the statistics on which 
the chance may be reckoned are frequently at the option of the particular indi¬ 
vidual and the chance at once becomes subjective and peculiar to him. Let me 
point out what I mean. A man is tossing a coin in a railway carriage, a country 
lad in the oarriage, judging by his experience of coins in the past, is ready to believe 
that the chance of a head is 1/2, i.e. that once in two occasions in the long run it 
will come down head. A scientific man (also without guile!) who has made 
experiments in tossing coins knows that every coin has a slight bias, and that 
there is in all probability a slight fraction of a per cent more heads or tails in the 
long run in the tossing of this particular coin. A man of the world knows that 
the coin-tosser is a swindler, and judges that his coin is loaded, so that the 
chance that it comes down head is very far from a half. And the coin-tosser 
himself? Well, he has no statistics at all of the conduct of this particular coin— 
it may be true or biased or loaded—but being an adept in tossing he can bring 
it down head or tail as he pleases. What are we to say is the chance that this 
coin will come down head? We have no statistics whatever of what happens 
when swindlers toss coins, which may after all unknown to them be loaded! 
Are we to say that the chance is an unknowable quantity, and that we cannot 
make any application of the theory of probability? I am inclined to think this 
would unduly narrow the field of our science. It seems to me that we can and 
should apply our theory to the chances subjectively estimated of each occupant 
of the railway carriage, These chances are based on the subjective experience of 
each individual with regard to coins under like conditions, and they certainly 
are more concerned with the laws under which people think about things, than 
with the laws of things themselves. If the country lad bets on a head the chance 
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of a head is for him one-half, for the scientific gentleman it must be a shade less 
or a shade more, for the man of the world it is a very small chance indeed, for 
the swindler it may be a certainty, if he means the lad to win on the first occasion, 
in order to excite him to betting heavier amounts. Now the beliefs of these four 
persons clearly differ in strength and their relative proportions are closely re¬ 
lated to the individual measurement of the chance, to what we may term the 
subjective chance. I am inclined to think with De Morgan that belief varies very 
closely with the subjective chance, but, this subjective chance depends upon the 
statistics of individual experience, and may differ widely from what we may 
term the objective chance, or the chance based upon statistics of the actual event 
in question and independent of the individual calculator. 

Turn (I hope, for the last time) to our cab-stand and the chance of finding a 
cab on it. Accurate statistics may have been taken of the absence of cabs upon 
it for a long period, perhaps, two or three years, For our present purposes these 
may represent the statistics for the calculation of the * objective chance ’. I may 
have observed the cab-stand, not very regularly, for a few months, and my result 
is: empty about once a week at 4 o’clock; a friend knows nothing about this 
particular cab-stand, but has formed statistics of suburban cab-stands in general; 
while another person without paying special attention to suburban cabs has 
formed pretty precise ideas as to cab-stands in London as a whole. The statistics 
of suburban cab-stands in particular, or of London cab-stands in general may 
be wide and accurate, or may be individual and approximate; in either case it is 
a subjective act which classes the particular cab-stand under either of these 
headings, the particular chance selected is the result of individual experience or 
subjective choice. If we ask what is the relation between subjective chance and 
objective chance, I think we can safely say, that while the two often differ 
widely, yet the more deep a man’s experience, the more thorough his observation 
and his knowledge of phenomena, the more closely his subjective statistics will 
fit the objective statistics. He will never, perhaps, make the two coincide, but 
in the long run of practical life his mistakes will be few and tend to balance each 
other; his subjective chance will approximate to the objective chance in Dr 
Venn’s sense. He will know what classes of statistics to apply to individual cases 
with the best results, or in ordinary language, ‘he will be a judge of men and 
things ’. If experience of life and acquaintance with fact lead a man’s subjective 
appreciation of chance to approximate to the objective value of chance, may we 
not say that, if belief varies with a man’s subjective view of chance, then 
ultimately it is objective chance which governs belief! 

There is a difficulty here which is I think sometimes overlooked, but which 
seems to me fundamental and we must regard it with a little care. I take a coin 
and I say the chance, when it is tossed, of a head is one-half. Now what exactly 
does this mean! One or other of two things, either: 

(1) I have tossed this same coin 10 or 20,000 times and found practically the 
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same number of heads and tails. Here the subjective and objective chances are 
practically identical. Or: 

(2) I have not tossed this special coin at all, but judge it to be like other 
coins, of which my own rough experience, and that of other men, presents 
practical statistics of the equality in the number of heads and tails obtained in 
a great number of tosses. 

Here the subjective and objective chances may or may not be the same, for 
after all the coin may be loaded or even a double-headed one. But in this case 
also experience of the coin will ultimately bring the subjective and objective 
chances to the same value, be it 1/2 or otherwise. Now let us go a stage further 
and suppose that experience has brought the subjective appreciation of chance 
to its objective value. Would that objective value be a measure of my belief? 
Now there is an assumption here, which I have before referred to, and want you 
now to particularly notice. The chance is really based on past statistics, it is the 
number of successes observed by the total number of trials. 20,000 times the 
coin has been tossed and 10,000 times—within a few units, perhaps—heads 

have appeared; the chance of head is or 1/2- But this is not all we mean 

when we say the chance is a half. We refer to the future as well as to the past, 
and we assume that if we were to throw the coin an indefinite number of further 
times, there would he in the long run as many heads as tails. Here is the 
assumption we make when chance is taken as the basis of belief as to the future. 
In other words the statistics of past experience are assumed to be identical with 
the statistics of what will happen in the future. When I say that the chance of a 
head is one-half, that statement is meaningless, if it be considered as referring 
to a single toss, it refers to what I believe will happen on the average in a very 
great number of future throws—i.e. a practical equality of heads and tails. This 
belief is based on two elements, first, statistics of past experience as to tossing 
and secondly the permanence of statistical ratios. 

This latter is a most important element and one which in reality forms a 
large factor of belief. Let us bring this out more clearly by a comparison of one 
or two cases. Statistics of tosses show a coin to be a true coin, to give in the long 
run head as often as tail—chance of a head therefore 1/2. 

Statistics of a certain country show that of 206 births 10(5 are boys—chance 
of a boy being born 106/206. My statistical experience of a certain cab-stand, 
shows that on an average there is no cab there once a week at 4 o’clock—chance 
of a cab 6/7. 

Now before I apportion my belief of what will happen in the long run in the 
future in these several cases, I have to consider the permanence of these statistical 
numbers, and the only way I can do this is by examining statistics as to the 
permanence of similar numbers. The factor of my belief depending on the 
permanency of the chances is itself rooted in statistics. How often have I found 
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the change given by statistics to be constant, how often to change? What indeed 
is the ‘chance 1 of a chance changing? 

A coin has given as many heads as tails in the past, why should it not now 
begin to give a vastly greater proportion of heads? The appeal is again to 
experience, and experience tells us that, if a coin be not battered, bent or altered, 
it maintains indefinitely the same chance of a head. 

On the other hand experience tells us that while in vital statistics there is 
scarcely ever an abrupt change, ratios do alter slowly and gradually, the chance 
of a boy being born as calculated from the last few years may hold for the next 
few, but it may vary from decade to decade and century to century. 

Still more may the chance of finding a cab oh the stand vary. I may have 
carried out my observations for two or three months, but the completion of a 
new line of railway or a Licensing Act may make a sudden breach of continuity, 
there is much less permanence in statistics of this kind, than in those of coins 
or babies. Clearly the chance determined from past statistics is not the only 
factor in apportioning my convictions as to the future appearance of heads, boy 
babies, and cabs. The chance of the statistics remaining in the future what they 
have been in the past must also be considered and be shown to be the same in all 
the cases where beliefs are compared. 

Thus, I think, we must agree with Dr Venn, although partly on other grounds, 
in recognizing that the chance of an event is not an accurate numerical measure 
of our belief in its occurrence, but on the other hand we may go so far with De 
Morgan as to assert that our belief is strengthened or weakened when the sub¬ 
jective chance based on our personal knowledge or experience—or on the rough 
and ready statistics of practical life—is increased or decreased. 

We may even go a stage further and construct a model universe in the 
following manner: Suppose a world in which men had such width of experience 
that their subjective appreciation of chance was equal to its objective value, and 
further that in this ideal world statistical ratios retained a permanent value— 
in the manner in which we actually find they do in games of chance—then in 
such a scientific ideal world chance might fairly be considered to measiue belief. 

It may be asked what is the use of such an ideal model as this? In the flesh 
and blood men of actual life with their prejudices and half-knowledges the sub¬ 
jective appreciation of chance diverges often widely from its objective value; 
further in this real world! few statistical ratios are actually permanent, they vary 
not only with time, but with the range and limits of our statistics. What then 
can the use of our model be? Well, of much the same use as the political econo¬ 
mists’ model of society governed by the laws of exchange or value, or the 
physicists’ molecular model of nature. Neither is true to reality, but both serve 
with certain reservations to describe in broad terms the general facts of economic 
and physical phenomena. In the same manner, because in a rough and approxi¬ 
mate way men’s subjective appreciation of chance does tend in the practical 
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experience of life to approach the objective value—and because in a great 
variety of oases chances calculated on past experience are found to remain 
permanent in the immediate future—so men’s beliefs as evidenced especially 
in conduct do vary with chance; and if chance be not a scientific measure of 
belief, it is yet in the average of men a rough and ready means of gauging, more 
or less accurately, the relative strength of convictions. 

It may seem strange to some of you to be told that chance is the measure of 
past experience. At first there may appear to be a very considerable difference 
between the chance that a boy or a girl will be born and the chance that a head 
or tail will turn up. We are quite ready to admit that statistics are needful in 
order to determine whether more boys or girls are born and so to determine the 
chance of a boy or girl birth. But we are not inclined at first to admit an equal 
necessity in the case of the coin. We aTe inclined to argue that ‘We see no reason 
why head should occur more frequently than tail’ and then convert this into 
‘There can be no reason why head should occur more frequently than tail’— 
and then ‘Head and tail must be equally frequent’. You will see the weakness 
of this argument at once by applying it to the ease of boys and girls. ‘We see no 
reason why more boys should be born than girls.’ ‘There can therefore be no 
reason why more boys should be born than girls, ’ and finally ‘ Ho more boys are 
born than girls’. Here statistics step in and upset all our preconceived notions. 
In fact all arguments of this kind remind us of the old mediaeval notions of 
physical science, which began by arguing as to what nature ought to do, instead 
of patiently observing what she did do. We may see no reason why head should 
occur rather than tail. But if there were not a very definite reason why head 
or tail should have the preference in each individual throw, then we may be 
quite sure that the coin would balance on its edge and exhibit neither head 
nor tail. 

Mere inspection of a coin would certainly not suffice to tell us that the 
■ chances’ of head and tail are equal. The head is different in shape and appear¬ 
ance from the tail, and the coin may really be biased by-this. Let us get over 
this difficulty by taking a perfectly uniform disk absolutely alike on both sides, 
and let us imagine it thrown up so that neither side has any advantage either on 
leaving our hand or on reaching the ground. Can we say that the chances of 
either side are equal without any appeal to experience—to statistics ? The fact 
is that if the two events were absolutely balanced in this manner, if not only we 
saw no reason why one should occur more than the other, but there was no 
reason, then there would be no chance of either event occurring at all. In our 
experience of nature there is no such thing as chance of this kind. The moment 
a coin or a die leaves the hand, its fate is really settled and there is no field for 
the ‘play of chance’ in the obscure sense we have just been referring to. The 
mechanical causes are perfectly definite and the occurrence of head or tail, 
ace or deuce, absolutely certain. It is quite true that these mechanical causes 
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are far too complex, too evenly balanced and too incapable of measurement 
for us to mechanically describe what must happen, and so predict head or tail. 
Mechanically the one or other is predetermined, but the multiplicity of causes 
varying so slightly and yet so effectively from throw t<? throw leaves us in ignor¬ 
ance as to the result. If we are merely ignorant as to a result which is mechanically 
perfectly certain, what is the meaning of chance in physical nature ? Simply this 
that we aid our ignorance by an appeal to past statistical experience. The chances 
of a coin falling head or tail being equal does not depend on my ignorance of what 
will occur, or on my seeing no reason why head more than tail should occur, but 
on my experience of the statistics of tossing coins. This experience is really summed 
up in the symbolic slang ‘ a toss up ’—as an expression for an equality of chances. 
I know from my own personal experience and from the common habits of men— 
as well as from the statements of gamblers and others—that loaded coins are 
not of frequent occurrence. Without this experience I could predict nothing of 
the tossing of a coin, it might invariably come down 99 % head; or having fallen 
on the first occasion head or tail, that fall might in itself determine what it 
would do on the second occasion. 

What I have said of tossing a coin holds good for the drawing of black and 
white balls out of a bag. It might seem at first sight that if 60 white and 50 
black balls were put into a bag and well mixed, then, each ball being replaced 
after the drawing, as many white as black balls will be drawn in a large number 
of trials. In other words the chances of drawing white and black balls are equal. 
But here again, if our statement is really to mean anything we must be appealing 
to some rough experience of the conduct of balls in bags. It is conceivable that 
the hand might have a preference for black balls, or that white balls would have 
a preference for each other and the bottom of the bag. If it be objected that the 
hand does not detect colour difference, and that gravity acts equally on equal 
balls if they be of different colours, we are at once appealing to a wide statistical 
experience resumed in certain fundamental laws of nature. We have left at once 
the shaky ground of subjective reasoning, and turned to statistics. 

But even in these cases direct experiment comes to our aid and provides the 
statistics, which are only roughly embodied in the everyday experience and 
opinions of mankind. Thus: 

Btjffon tossed a coin 4040 times, there resulted: 1992 heads, 2048 tails, or 
49 % heads, 51 % tails. 

Quetelet made 4096 drawings out of a bag containing an equal number of 
black and white balls, there resulted: white balls 2066, black balls 2030, or 
50-4% white and 49-6% black. 

Me Griffith, one of my students, has kindly tossed a penny 8178 times and 
there resulted: 4092 heads, 4086 tails, or 50-04% and 49-96% tails. 

W ester©aari) made 10,000 drawings out of a bag containing equal numbers 
of red and white balls well shaken before each drawing after the replacement of 
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the previously drawn ball. He obtained: white balls 5011, red balls 4989, or 
50*11% white and 49*89% red, 

Your present Lecturer tossed (as a holiday task) a shilling 24,000 times. 
He obtained for the first 12,000 tosses: 5981 heads, 6019 tails, or 49*84 % heads, 
50*16% tails; second 12,000 tosses: 5992 heads, 6008 tails, or 49*933.% heads, 
50*067 % tails. In both series there is a balance in favour of tails: in the first 
of less than 1/6 %; in the second of about 1/15 %. 

Taking both series together we have: 24,000 tosses: 11,973 heads, 12,027 
tails, or 49*8875% heads and 50*1125% tails. 

To avoid any chance of there being a slight loading in the coin—a very slight 
bias towards tails—let us call the heads, tails and tails, heads in the first 
12,000 tosses, we then find: 12,011 heads and 11,989 tails; or 50*046% heads and 
49*954% tails. 

Thus to 1/20 of a per cent heads and tails are equal, or to express it in another 
manner there has on the average been only one head too many in 1200 tosses. 
Billion’s experiments coincide with mine in showing a slight bias in favour of 
tail. 

Finally I have analysed the red and black events in an entire month’s play of 
the roulette tables at Monte Carlo. I find that out of 16,178 throws of the ball* 
8111 fell into a red number and 8067 into a black, or there were 50*14% red 
and 49*86% black. 

These experiments amply confirm the rougher statistical experience of man¬ 
kind as to the equality of chances in tossing, or drawing balls from bags, or 
playing roulette. It is on experience of this kind, on accurate statistical measure¬ 
ment, not on a priori reasoning or subjective opinion, that the data of probability 
are to be based. 

In my next lecture I shall deal more at length with the nature of the statistics 
by which we supplement our ignorance of what is about to happen. 


* The twenty-seventh figure, 0, was of course omitted to equalize chances. 
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MEDICAL STATISTICS FROM GRAUNT TO FARR 
By MAJOR GREENWOOD 
INTRODUCTION 

Under the Fitzpatrick Trust, a Fellow of the Royal College of Physicians of 
London is chosen annually by the President and Censors to deliver two lectures 
in the College on * The History of Medicine I had the honour of being chosen 
for this office in 1940 but, for obvious reasons, the lectures were not delivered, 
and it may be safely assumed that some years will pass before a medical audience 
will have time to attend to the history of a subject the modern practice of which 
does not make a strong appeal to physicians. 

The nature of the intended audience inclined me to stress the medical rather 
than the purely statistical aspects of the story and I have trodden ground over 
which a greater man passed some years ago. I hope that Karl Pearson’s studies 
of some or all of these old heroes will eventually be printed, and I know that my 
slight essays can ill sustain a comparison. But, precisely because they are 
slight and linger over small traits and human oddities, they may, in these times, 
wile away an hour or two. I have eliminated some explanations which no 
statistician or biometrician needs and the medical technicalities are few. Perhaps 
a note on the London College of Physicians as it was in the days to which these 
studies relate should be added. 

The College was more than a century old when John Graunt was born, and 
the corporation consisted wholly of physicians who were Doctors of Medicine of 
Oxford or Cambridge;- these were the Fellows. Physicians not Doctors of 
Medicine of Oxford or Cambridge were admissible only to the grade of Licentiate, 
and it was not until the nineteenth century, when Farr was a young man, that 
the exclusive privilege of the senior universities was abolished. It was not until 
Farr was a middle-aged man that the College had any direct contact with general 
practitioners of medicine and began to examine persons who did not seek to 
practise solely as physicians. In modern usage the College licence, L.R.C.P. 
(now only granted jointly with the membership of the Royal College of Surgeons, 
M.R.C.S.), is a diploma obtained by a large proportion of general medical 
practitioners in the South of England. Down to Farr’s time, the L.R.C.P. was a 
‘specialist’ diploma and could not have been taken by a general practitioner 
(the apothecary of those days) at all. The old L.R.C.P. is represented by the 
M.R.C.P. of our own time but with this distinction. Now, Fellows (F.R.C.P.) 
are normally chosen from the body of M.R.C.P.’s. In the past only Doctors of 
Medicine of Oxford or Cambridge could be Fellows, and before election but after 
examination were known as ‘candidates’, not licentiates. The great physician 
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Sydenham was,never more than a licentiate. He graduated M.B. at Oxford and, 
for some unknown reason, never proceeded M.D. until near the end of his life, 
when he took the higher degree not at Oxford but at Cambridge. 

I. THE LIVES OP PETTY AND GRAUNT 

It is always rash to assign an absolute beginning to any form of intellectual 
effort, to say that this or that man was the very first to fashion some organon 
which has proved valuable. All we are justified in saying is that this or that 
plan’s work can be shown to have so directly influenced the thought of his con¬ 
temporaries or successors that from his day the method he used has never been 
forgotten. It may be that the lost works of the school of the Empirics Galen 
despised anticipated the numerical method of Louis—some words of Celsus are 
consistent with the hypothesis. It may be that in the long succession of parish 
clerks who for more than a century transcribed the London Bills of Mortality, 
one or two suggested that these figures might have some other use than that of 
warning His Highness of the need to move into Clean Air. But we do not know. 
We do know that out of the casual intercourse of two Englishmen in the seven¬ 
teenth century was produced a method of scientific investigation which has 
never ceased to be applied and has influenced for good or ill the thought of all 
mankind. In that sense at lekst we may fairly hold that John Graunt and 
William Petty were the pioneers not only of medical.statistics and vital statistics 
but of the numerical method as applied to the phenomena of human society. 

John Graunt and William Petty were both of Hampshire stock. Petty was 
of Hampshire birth, born on Monday, 26 May 1623, and was three years younger 
than John Graunt, who was born at the Seven Stars in Birchin Lane on 24 April 
1620. 

Materials for writing Petty’s life are abundant; indeed a good biography of 
him was written nearly fifty years ago by his descendant Lord Edmond Fitz- 
maurice, and since then much of the material used by Lord Edmond has been 
printed. Sources for Graunt’s biography are scanty, the most valuable John 
Aubrey’s brief life of him.* Graunt and Petty became acquainted in or before 
1660. The circumstances of that first acquaintance are interesting to those who 
meditate upon the perepeteia of human fate. It was the contact of client and 
patron. 

John Graunt’s early life and manhood were those of the Industrious 
Apprentice. His father was a city tradesman, who bred his son to the profession 
of haberdasher of small wares, John ‘rose early in the morning to his study 
before shop-time \and learned Latin and French, but did not neglect his business. 
He was free of the Drapers’ Company and went through the city offices as far as 

* Brief Lives, chiefly of Contemporaries, set down by John Aubrey, between the years 1669 and 
1696, edited by Andrew Clark, Oxford, 1896, 1, 271 et seq. 
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common councilman; he was captain and then major of the trained bands (the 
ancestor of the Honourable Artillery Company). At the time of the Great Fire 
he is said to have been an opulent merchant. Even fifteen years earlier he—and 
no doubt his father (1692-1662)—had city influence. At that time a Gresham 
professorship was vacant and a young Dr Petty was anxious to obtain it. This 
young man’s career had been unlike that of an industrious apprentice; it had 
been, even for the seventeenth century, romantic. His father was a clothier in 
Romsey, who ‘ did dye his owne cloathes ’ in a small way of business. When 
William was a child, ‘his greatest delight was to be looking on the artificers— 
e.g. smyths, the watch-maker, carpenters, joyners etc.—and at twelve years old 
could have worked at any of these trades. Here he went to sohoole, and learnt 
by 12 yeares a competent smattering of Latin, and was entred into Greek’ 
(Aubrey, Clark’s edition, 2, 140). 

But the precocious lad did not find a patron in Romsey and was shipped for 
a cabin boy at the age of fourteen. His short sight earned him a taste of the 
rope’s end,' and after rather less than a year at sea he broke his leg and was set 
ashore in Caen to shift for himself. ‘Le petit matelot otaglois qui parle latin et 
grec’ attracted sympathy and obtained instruction in Caen. Caen was not a 
famous seat of learning like Leyden or Montpellier, but the Fellows and 
licentiates of the College of Physicians admitted between 1640 and 1700 include 
the names of four persons who studied or graduated in Caen (Nicholas Lamy, 
Theophilus Garencieres, John Peachi and Richard Griffiths). Petty, however, 
was not then thinking of medicine but mathematics and navigation and came 
home to join the navy. In what capacity he served is unknown; he merely says 
(in his Will) that his knowledge of arithmetic, geometry, astronomy conducing 
to navigation, etc., and his having been at the University of Caen, ‘preferred me 
to the King’s Navy where at the age of 20 years, I had gotten up about three 
score pounds, with as much mathematics as any of my age was known to have 
had’. His naval career was short, for in 1643 he was again on the continent. 
Here he wandered in the Netherlands and France and studied medicine or at 
least anatomy. He frequented the company of more eminent refugees, such as 
Pell and Hobbes, as well as that of the French mathematician Mersen. He was 
very poor and told Aubrey that he once lived for a week on three pennyworth of 
walnuts, but on his return to England the three score pounds had increased to 
seventy and he had also educated his brother Anthony. 

At first Petty seems to have tried to make a living out of his father’s business, 
but he soon went to London with a patented manifold letter writer and sundry 
other schemes of an educational character. These occupied him between 1643 
and 1649 and made him acquainted with various men of science, among others 
Wallis and Wilkins, but were not remunerative, and in 164# he migrated to 
Oxford. 

Petty was created Doctor of Medicine on 7 March 1649 by virtue of a 
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dispensation from the delegates (no doubt the parliamentary equivalent of the 
Royal Mandate of later and earlier times). He was also made a Fellow of 
Brasenose and had already been appointed deputy to the Professor of Anatomy. 
He was admitted a candidate of the College of Physicians in June 1650 (he was 
not elected a Fellow until 1655 and was admitted on 25 June 1658). At Oxford 
he became something of a popular hero by resuscitating (on 14 December 1651) 
an inefficiently hanged criminal, who, condemned for the murder of an illegiti¬ 
mate child, is said to have survived to be the mother of lawfully begotten 
offspring. 

Academically Petty rose to be full Professor of Anatomy and Vice-Principal 
of Brasenose. It is at this point (as usual the precise dates are dubious) that he 
became a candidate for a Gresham professorship and made contact with John 
Graunt. 

Although, as I have said, the materials for a biography of Petty are abundant, 
all we know of his early years comes from himself or from friends of later life 
who knew no more than he told them. We have no independent means of 
judging the extent of his culture. There is good evidence that he knew more 
Latin than most Fellows of the College of Physicians know now; none that he 
was an exact scholar (indeed we have his own word, which I am not prepared 
to gainsay,* to the contrary). He was certainly admitted to friendship by some 
men, such as Walhs and Pell, who were serious mathematicians, as by others, 
such as Hobbes, who were not. But whether he could fairly be called a mathe¬ 
matician is doubtful. Of his medical knowledge we know little.. He left medical 
manuscripts, but these are stid unpublished; of his clinical experience we know 
nothing. 

Petty told Aubrey that ‘he hath read but little, that is to say, not since 
25 aetat., and is of Mr. Hobbes his mind, that had he read much, as some men 
have, he had not known as much as he does, nor should have made such dis¬ 
coveries and improvements ’. But it is at least certain that he made a favourable 
impression upon men who had read a good deal and that the young Dr Petty of 
1660 was thought a promising man. Still it had been an odd career and one 
wonders what a steady business man in the city of London thought of it. 

Why the anatomy professor who had resuscitated half-hanged Ann Green 
should be made a professor of music is not obvious, and if the Gresham appoint¬ 
ments were jobs, why should the job be done for Petty ? The modern imaginative 
historian might suggest various reasons. For instance, that Petty made a 

* If No. 88 of The Petty Papers (2, 36) ia a typical example of Potty’s Latin Ptobc style, there 
is not much to be said for it. Here is an example:..‘An dulcius est humanae naturae permultos 
Buam potestatem in unum quondam et in perpetuum transferre, id est pendis amittere quam ipso 
puel deindem servare, vel paulatium et in breve tempus irogare, a seipsis demo refonnondam et 
disponendam alioquin pro ut, mutato tam rerum quam animi indies suaserit?’ Some of the 
gibberish may be due to the editor’s failure to decipher the handwriting, but no emendation could 
twist this into unbarbaric prose. 
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conquest of Graunt, perhaps had Hampshire friends who were friends of the 
Graunt family, perhaps talked about political arithmetic. We have no evidence 
at all. If the Gresham Professor of Music had duties, Petty did not perform them; 
about the time of Ms appointment he obtained leave of absence from Brasenose 
and within a year (in 1652) had left for Ireland, where he was to be very busy 
for some time to come and to make, or found, his material fortunes. 

Macaulay (chap, m) says that at the end of the Stuart period the greatest 
estates in the kingdom very little exceeded twenty thousand a year. 

The Duke of Ormond had twenty-two thousand a year. The Duke of Buckingham, 
before his extravagance had impaired his great property, had nineteen thousand six 
hundred a year. George Monk, Duke of Albemarle, who had been rewarded for his eminent 
services with immense grants of crown land, and who had been notorious both for covetous¬ 
ness and for parsimony, left fifteen thousand a year of real estate, and sixty thousand 
pounds in money, which probably yielded seven per cent. These three Dukes were supposed 
to be three of the very richest subjects in England. 

In 1686 Petty made his Will. This Will is a curiously interesting document, 
because it is also an autobiography. It is rich in arithmetical statements and, 
like much of Petty’s arithmetic, the statements may be optimistic. Petty’s final 
casting of his accounts is in this fashion: ‘Whereupon I say in gross, that my 
reall estate or income may be £6,500 per arm. my personall estate about £45,000, 
my bad and desparate debts, 30 thousand pounds, and the improvements may 
be £4000 per ann., in all £15,000 per arm. ut supra.’ 

The details of the calculation are perplexing enough; still if the above cited 
dukes were the richest subjects of the king and if (Macaulay) ‘the average income 
of a temporal peer was estimated by the best informed persons, at about three 
thousand a year ’, Sir William Petty, of the year 1685, had travelled as far from 
the young Oxford professor of 1650 as that budding physician from the little 
English cabin boy who spoke Latin and Greek, in Caen, in 1638. The details of 
the fortune-building are not our concern. The shortest account is Petty’s own in 
his Will. He says that by the end of his Oxford career be had a stock of four 
hundred pounds and received an advance of one hundred more on setting out 
for Ireland, 

Upon the tenth of September, 1652,1 landed att Waterford, in Ireland, Phisitian to the 
army, who had suppressed the Rebellion began in the year 1641, and to the Generali of the 
same, and the Head Quarters, at the rate of 20s. per diem, at which I continued, till June, 
1669, gaining by my practice about £400 per annum, above the said sailary. About 
September, 1654, I, perceiving that the admeasurement of the lands forfeited by the fore- 
mentioned Rebellion, and intended to regulate the satisfaction of the soldiers who had 
suppressed the same, was most insufficiently and absurdly managed, I obtained a contract, 
dated the 11th. of December, 1654, for making the said admeasurement, and by God’s 
blessing so performed the same as that I gained about nine thousand pounds thereby, which 
with the £600 above mentioned, my sallary of 20s. per diem, the benefit of my practice, 
together with £600 given me for directing an after survey of the adventrs lands, and £800 
more for 2 years sallary as Clerk of the Couneell, raised me an estate of about thirteen thou¬ 
sand pounds in ready and reall money, at a time, when, without art, interest, or authority, 
Biometrika xxxu 8 
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men bought, as much lands for 10s, in mail money as in this year, 1685, yield 10s. per atm. 
rent above his Maties quitt rents ( The Life of Sir William Petty, by Lord Edmond Fitz- 
maurice, London 1895, p. 319). 

No one would willingly rake over the embers of Irish history—still glowing 
after nearly three hundred years. Petty believed himself to be a good man 
struggling against adversity and a public benefactor treated with gross injustice 
to the day of his death. Lecky (History of Ireland, vol. 1, chap. 1, p. Ill of 
popular edition) took a less favourable view. Even if the subject were relevant. 
to my undertaking, which it is not, I have not the training in historical research 
to justify me in writing about it. There are, however, some points of psycho¬ 
logical interest. 

Petty did not, like his contemporary Thomas Sydenham, actually take up 
arms against the king, but he was even more plainly a protege of the king’s 
enemies. Sydenham’s military career was unimportant; there is no reason to 
believe that he ever exchanged a word with a member of the Cromwell family. 
Petty was the confidential adviser and close personal friend of Henry Cromwell; 
his services to the Com m onwealth authorities were the foundation of his fortune, 
Like many people who have social gifts he had the gentle art of making enemies. 

Pepys, Aubrey and Evelyn concur in the judgment that Petty was a most 
entertaining companion. Evelyn says he was a wonderful mimic. He could 
speak ‘now like a grave orthodox divine; then falling into the Presbyterian way; 
then to Panatical, to Quaker, to Monk, and to Friar and to Popish Priest’. The 
gift he exercised among his friends. 

My Lord D. of Ormond oirce obtained it of him, and was almost ravished with admira¬ 
tion; but by and by he fell upon a serious reprimand of the faults and miscarriages of some 
Princes and Governors, which, though he named none, did so sensibly touch the Duke, 
who was then Lieutenant of Ireland, that ho began to be very uneasy, and wished the spirit 
layed, which he had raised; for he was neither able to endure such truths, nor could hut be 
delighted. At last he turned his discourse to a ridiculous subject, and come down from the 
joint-stool on which he had stood, but my lord would not have him preach any more 
(Evelyn). 

My lord Duke was not the first or last person to fail to relish a joke against 
himself. 

In The Londoners a challenged party names garden hoes as the weapons, 
That was Mr Robert Hichens’s fun, In real life, Petty, challenged to mortal 
combat by a Cromwellian soldier, pleaded his myopia and demanded that the 
duel should take place in a cellar and the weapons be axes. 

A man like this makes friends or at least admirers, also enemies. Long before 
the king enjoyed his own again, Petty had a host of enemies. When the king 
returned, one might have expected that Petty’s position would be critical. 
According to his own account h e-did lose something, but he was knighted and the 
losses, such as they were, did not seem to stay the growth of his fortune. At the 
Restoration he was already prosperous and he died wealthy; Perhaps the 
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explanation is that Petty was really as great a public benefactor as he thought 
he was. Perhaps the reason is personal. King Charles loved wits (in the old and 
new sense of the word) and Petty was a wit. The scanty specimens of what 
Petty’s modern representative calls ‘Rabelaisian 1 printed from the Petty papers 
would not have appealed to such a connoisseur in this genre as the king—we 
know from Halifax that the king liked to be the raconteur in this field and indeed 
repeated himself often—but he would have relished a good mimic. Still more 
important might have been their common virtuosity. 

Charles was interested in experimental science, and although Petty certainly 
knew more than the king, he may not have known very much more. Neither 
Charles nor James would have been able to find more common ground with 
Isaao Newton than in a later age Bonaparte found with Laplace. But the 
ingenious Dr Petty, who had resuscitated half-hanged Ann Green (which would 
be a capital story if well told), invented an unsinkable ship, had a dozen plans 
for doubling the king’s revenue, and knew something of everything, probably 
did more than Wilkins to interest the king in the new society of virtuosos (how 
the long must have relished the story of the planting of horns in Goa*), and he 
may incidentally have interested the king in his business affairs. This is all 
speculation; what is sure is that when Petty was back in London and able to 
renew personal intercourse with John Graunt, their relation was no longer that 
of client and patron. Por a few years more, Graunt was to be a solid merchant, 
but before long Petty was the patron and Graunt the client. 

At this point it will be convenient to conclude the biographical facts relating 
to Graunt. I take them mainly from Aubrey. 

Graunt continued to be a prosperous city tradesman for many years after 
his first meeting with Petty. ‘He was ’, says Aubrey, ‘a man generally beloved; 
a faithful friend. Often chosen for his prudence and justice to be an arbitrator; 
and he was a great peace-maker. He had an excellent working head, and was 
facetious and fluent in his conversation.’ Pepys thought as well of Graunt as did 
Aubrey, admiring both his conversation and his collection of prints—‘the best 
collection of anything almost that ever I saw 5 . 

Prom the Restoration for several years Graunt figures in London intellectual 
society (he was elected P.R.S. in 1663), but a material calamity was at hand. 
The Pire of 1666 no doubt caused Graunt direct financial loss; this might have 
been repaired. But, although brought up in Puritan ways, ‘he fell’, to quote 
Aubrey, ‘to buying and reading of the best Socinian bookes, and for severall 

* Sir Philiberto Vematti, Resident in Batavia, had certain inquiries sent him. by order of the 
Royal Sooioty. The eighth question was: ‘What ground there may be for that Relation, concerning 
Horns taking Toot, and growing about Goa? ’ This is Sir Philiberto’s answer: ‘Inquiring about this, 
a friend laughed, and told me it was a jeer put upon the Portuguese, because the women of Goa are 
counted much given to leohery’ (Sprat’s History of the Boyal Society of London, 2nd ed. London 
1702, p. 161). 
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jars continued of that opinion. At least, about. . .he turned a Roman Catho- 
que, of which religion he dyed a great zealot.’ 

Graunt’s path to Rome was similar to that of young Edmund Gibbon, but 
le results on the career of a city tradesman in the days of Oates triumphans 
ere more serious than a visit to Lausanne. Graunt became bankrupt. Hie 
ame dropped out of the list of the Royal Society after 1666, and in 1674 he 
ied. There is evidence that in these last years of worldly misfortune, when the 
heel had come full circle since Graunt had secured the Gresham professorship 
>r Petty, Petty helped Graunt. When Petty was in Ireland, Graunt acted in 
>me sort as his London agent, and Petty conceived a plan of settling Graunt in 
•eland. But (we have, of course, only Petty’s word for this) Graunt was not an 
isy man to help; it is possible, of course, that he may have resented Petty’s 
dmonitions. ' You have done amiss in sundry particulars, which I need not 
lention because you yourself may easily conjecture my meanings. However we 
lave these things to God and be mindful of what is the sum of all religion, and 
f what is and ever was true religion all the world over.’ This is an extract from 
letter of January 1673 to Graunt (The Petty-Southwell Correspondence , p. xxix) 
rinted by the late Marquis of Lansdowne. If Lord Lansdowne was right (the 
dole letter is not printed) in thinking this a reference to Graunt’s conversion 
or perversion) ‘of which’, says Lord Lansdowne, ‘Petty seems to have dis- 
pproved on temporal rather than spiritual grounds’, it might have hurt a 
ensitive man. 

Graunt died on Easter Eve 1674 and was buried the Wednesday following in 
!t Dunstan’s church in Pleet Street. ‘A great number of ingeniose persons 
ftended him to his grave. Among others, with teares, was that ingeniose great 
drtuoso, Sir William Petty, his old and intimate acquaintance, who was sometime 
i> student of Brasenose College.’ Sir William outlived his friend thirteen years 
md lies in Romsey Abbey. Until a descendant in the nineteenth century (the 
ihird Marquis of Lansdowne) erected a monument, ‘not even an inscription 
ndicated that the founder of political economy lay in Rumsey Abbey’ (Fitz- 
naurice, p. 315). 

Graunt had a son who died in Persia and a daughter who, according to 
Aubrey, became a nun at Ghent. Nothing is known of descendants. 

Petty’s widow was raised to the peerage and her elder sons, Charles and 
Eenry, died without issue. But the title was revived in favour of the grandson 
of John Fitzmaurice, the second surviving son of Thomas Fitzmaurice, Earl of 
Kerry, who, as the above-mentioned grandson remarked, had ‘married luckily 
for me and mine, a very ugly woman who brought into his family whatever degree 
of sense may have appeared in it, or whatever wealth is likely to remain in it’. 
This ill-favoured woman was Petty’s daughter Anne, to whom her father wrote: 

My pretty little Pusling and my daughter Ann 
That shall bee a countesse, if her pappa can. 
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The cynical grandson was George Ill’s prime minister and afterwards his bite 
noire, ‘The Jesuit of Berkley Square’ and first Marquis of Lansdowne. 

Of the two friends, one has left an intellectual monument only; descendants 
of the other have been famous in English history. 

Of these, best known are the first and third Marquises of Lansdowne, 
William (1737-1805) and Henry (1780-1863). Of the first marquis, much better 
known as Lord Shelburne (the title created for Lady Petty), every schoolboy— 
not only Macaulay’s schoolboy—has heard; the quarrel between Charles Eox 
and Shelburne, the party split, the coalition ministry and so on. Schoolboys who 
have reached the sixth and Leeky’s History of England in the Eighteenth Century, 
know a little more. Shelburne, who had much more than a tincture of his 
great-grandfather’s ability and applied himself to economic studies, was one of 
the earliest to appreciate the importance of Adam Smith and was highly thought 
of by two good. judges of scientific ability, Benjamin Franklin and Jeremy 
Bentham. 

As a public man, no parliamentary statesman before or since obtained so 
universal a dislike, a positive hatred shared by those who knew him and those 
who did not. 

There is certainly nothing in the actions of Shelburne to justify this extreme un¬ 
popularity. Much of it was, I believe, simply due to an artificial, overstrained, and affectedly 
obsequious manner, but much also to certain faults of character, which it is not difficult to 
detect. Most of the portraits that were drawn of him concur in representing him as a harsh, 
cynical, and sarcastic judge of the motives of others; extremely suspicious; jealous and 
reserved in his dealings with his colleagues; accustomed to pursue tenaciously ends of his 
own, which he did not frankly communicate, and frequently passing from a language of 
great superciliousness and arrogance to a strain of profuse flattery (Lecky, 5, 138). 

How far some of these characteristics may be recognized in Shelburne’s 
ancestor, we shall inquire in due course. 

The contrast between Malagrida* and his son Henry is shattering. It is this 
Marquis of Lansdowne of whom nearly everybody thinks when he sees the title 
in a book, and rightly so. Walter Bagehot wrote: 

You may observe that when an ancient liberal, Lord John Russell, or any of the 
essential sect, has done anything very queer, the last thing you would imagine anybody 
would dream of doing, and is attacked for it, he always answers boldly, ‘Lord Lansdowne 
said I might' ; or if it is a ponderous day, the eloquence runs, ‘A noble friend with whom I 
have had the inestimable advantage of being associated from the commencement (the 
infantile period I might say) of my political life, and to whose advice,’ etc,, etc., etc.—and a 
very cheerful existence it must be for ‘my noble friend’ to be expected to justify—(for they 
never say it exoept they have done something very odd)—and dignify every aberration. 
Still it must be-a beautiful feeling to have a man like Lord John, to have a stiff, small man 

* Malagrida was an Italian Jesuit settled in Portugal who was burned in 1761. The supposed 
jesuitical propensities of Shelburne led to the name becoming his popular title. Hence Goldsmith’s 
unintended mot: 1 Do you know that I never oould conceive the reason why they call you Malagrida, 
for Malagrida was a very good sort of man.’ 
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bowing down before you. And a good judge (Sydney Smith) certainly suggested the con¬ 
ferring of this authority. 1 VVhy do they not talk over the virtues and excellencies of 
Lansdowne? There is no man. who performs the duties of life better, or fills a high station 
in a more becoming manner. He is full of knowledge, and eager for its acquisition. His 
remarkable politeness is the result of good nature, regulated by good sense. He looks for 
talents and qualities among all ranks of men, and adds them to his stock of society, as a 
botanist does his plants; and while other aristocrats are yawning among stars and garters, 
Lansdowne is refreshing his soul with the fancy and genius which he has found in odd places, 
and gathered to the marbles and pictures of his palace. Then he is an honest politician, a 
wise statesman, and has a philosophic mind etc., etc. Here is devotion for a carping critic; 
and who ever heard before of bonhomie in an idol? (Bagehot, Works, 2, 64-5). 

Of the father, Atticus (an alias of ‘Junius’) wrote: 

The Earl of Shelburne had initiated himself in business by carrying messages between 
the Earl of Bute and Mr. Fox, and was for some time a favourite with both. Before he was 
an ensign he thought himself fit to be a general, and to be a leading minister before he ever 
saw a public office. The life of this young man is a satire on mankind. The treachery which 
deserts a friend, might be a virtue compared to the fawning baseness which attachos itself 
to a declared enemy (Letters of Junius, Wade’s edition, 2, 248). 

Naturally justice was no more to be expected in eighteenth-century news¬ 
paper diatribes than in the twentieth century, but a clever caricaturist does not 
represent Charles Fox as a living skeleton. Those who attacked the son—there 
were such people—took a different line, as Bagehot hints. Perhaps even in his 
very different character something of the ancestral Petty survives. We shall try 
to discover what this was. 

Forty years ago Hull brought out an edition of Petty’s tracts in which he 
included Graunt’s work. In 1927 the fifth Marquis of Lansdowne printed a 
selection from the Petty papers and in 1928 the correspondence between Petty 
and his wife’s cousin,* Sir Robert Southwell [The Petty-Southwell Correspondence, 
edited by the Marquis of Lansdowne, London 1928). 

We shall have to examine in detail both the ‘works’ and the ‘papers’, but, 
as a light upon the character of Petty, the Southwell correspondence is the 
strongest we have. Southwell himself was some generations farther away from 
adventuring than Petty. He came of an ‘undertaker’ stock—the adventurers in 
Ireland of Queen Elizabeth’s time—and his father was vice-admiral of Munster 
before him. He was born in 1635 (died in 1702), regularly educated (Queen’s 
College, Oxford and Lincoln’s Inn), knighted in 1665, for some time Clerk of the 
Privy Council, in the diplomatic service, held other offices, was a member of 
parliament and eventually settled in a country house near Bath. He was 
President of the Royal Society 1690-5. He might be described as a lesser 
William Temple; better educated and less selfish, not so able, but with the same 
cool, cautious judgment; a psychological antithesis of his correspondent. 

* Petty married in 1667 Lady Fenton, widow of Sir Maurice Fenton and daughter of Sir 
Hardress Waller who, knighted in 1629, fought for the Parliament and was one of the King’s 
judges; he was a major general in Ireland in 1650-1 and a patron of Petty there. 
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The correspondence covers the eleven years 1676-87. Both men were, even 
by modern standards, middle aged. They write one to another with complete 
frankness; there is a remarkable absence of the elaborate verbal formalities 
which in seventeenth-century and even eighteenth-century letters are so 
wearisome. 

Petty’s side of the correspondence consists roughly of domesticities 10 parts, 
eager accounts of his quarrels and law suits concerning money 40 parts, discussion 
of papers or projected papers 40 parts, add autobiographical boasting to make 
up the 100. 

In the purely domestic part of the correspondence, Petty is seen as a kind, 
good-natured father interested in the doings of his relations by marriage, also 
as a very bad judge of others’ feelings. I remember to have read an unpublished 
letter by the famous Edwin Chadwick, the great and very unpopular sanitarian 
of a century ago. It was written to a friend whose wife had just died of puerperal 
fever. Chadwick expressed regret in the shortest possible formula and assured 
his correspondent that the best solace he could have would be to assist in 
pushing forward a bill (which I think he enclosed) to promote some sanitary 
reform which would have the effect of making it less likely that other men would 
lose their wives in childbed. I remember thinking that, however sensible the 
recommendation, the man who gave it was not likely to bring much comfort to 
his friend. 

Petty was very much like Chadwick here. Southwell lost his wife in 1681 
and Petty condoled with him as follows: 

When your good father dyed, I told you that hee was full of years and ripe fruit, and 
that you had no reason to wish him longer in the paines of this world. But I cannot use 
the same Argument in this Case for your Lady is taken away somewhat within half the 
ordinary age of Man and soon after you have been perfectly married to her; for I cannot 
believe your perfect union and assimulacon was made till many years after the Ceremonies 
at Kinsington. 

What I have hitherto said tends to aggravate rather than mitigate your sorrow. But 
as the sun shining strongly upon burning Coles doth quench them, so perhaps the sadder 
Sentiments that I beget in you may extinguish those which now afflict you. The next Thing 
I shall say is, That when I myself married, I was scarce a year younger then you are now, 
and consequently do apprehend That you have a second Crop of Contentment and as much 
yet to come as ever I have had. 

This remark, curiously enough, was not well received. 

You doe not onely condole the great loss I have sustained in a wife, but you seeme to 
think it reparable... . But when by 19 yeares conversation I knew the greats vertues of her 
mind, and discover since her death a more secrett correspondence with Heaven in Acts of 
Pietye and devotion (which before I knew not of), you will allow me, at least for my 
Children’s sake, to lament that they have too early lost their guide. 

Petty could not, it seems, understand that Southwell was wounded and 
returned to the charge in a letter which is lost. That letter provoked a reply 
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which even Petty could not misunderstand and elicited an apology {Correspon¬ 
dence, p, 90). 

Petty was quite incorrigible. A few years later Southwell had another family 
bereavement and is condoled with in the following terms: 

That by the death of your Father, Mother and Sister, of Sir Edward Deering and your 
three nephews, you are the Head and Governor of both Familyes. That by the death of 
Rupe, Ingenious Neddy culminates; and by that of your Excellent Lady you are entitled to 
that million I mentioned of unmarryed teeming Lad yes. 

Once again, Southwell was not comforted. ‘Cousin, you doe wipe off Teares 
at a very strange rate, but why did nature furnish Them if there must be no 
Sorrow 1 ’ 

Petty had a very quick perception of when and where Ms shoe pinched, but 
no imaginative sympathy. 

Passing to Petty’s financial affairs and lawsuits, the position was this. By 
original grants, by purchase and in various ways, Petty had widely scattered 
Irish interests. Questions of the validity of the original grants, of rent charges 
due to the crown or to other grantees, of matters of fact and matters of law were 
endless. Petty saw himself steadily as a great public benefactor harassed by 
scoundrels, and it never occurred to him even as a theoretical possibility that 
others had rights. Of his manner of proceeding the editor of the correspondence 
gives a typical example (Correspondence, p. 90). In 1681 Petty gave evidence 
before Lord Chief Baron Hen as to ‘Soldier’s land’ which he had bought in 
Kerry and, it seems, the court decided against him. 

Petty gave vent to his chagrin in a long and scurrilous lampoon against the offending 
judge, entitled: ‘ henealogie or the legend of Hen-Hene and Pen-Hene’, in two parts. 
Whereof the first doth in 24 chapters of Raillery, contain the enchantoments, metamor¬ 
phoses and merry conceits relating to them. The second part contayning (in good earnest) 
the foolish, erroneous, absurd, malicious and ridiculous ‘judgements of hen-hene’. 
Fortunately perhaps for tho repute of its author, this diatribe was never made public. 

Fortunately, also, for a more material reason; it would probably have led 
to a second incarceration for contempt of court. 

Southwell evidently viewed his good cousin’s proceedings with a mixture of 
gentlemanlike annoyance and practical minded contempt. He expressed these 
feelings more than once; the following extract from a letter of 1677 is typical; 
the particular suit in progress to which reference is made was a claim for £6000 
in respect of a sum of £2500 actually advanced by Petty to the Farmers of 
Revenue. 

And suffer from me this expostulation, who wish your prosperity as much as any man 
living; and having opportunities to see and heare what the temper of the world is towards 
you, I cannot but wish you well in Port, or rather upon the firm Land, and to have very 
little or nothing at all left to the mercy and good will of others. For there is generally 
imbibed such an opinion and dread of your superiority and reach over other men in the 
wayes of dealing, that they hate what they feare, anil find wayes to make him feare that is 
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fearcl. I doe the more freely open, my soul to you in this matter, because tis not for the 
vitells that you contend, hut for outward Limbs and accessions, without which you can 
subsist with Plenty and Honour. And therefore to throw what you have quite away, or at 
least to put it in dayly hazard onely to make it a little more than it is, Is what you would 
condemne a thousand times over in another. And you would not think the Reply sufficient 
that there was plain Right in the Cause and Justice of their side, for iniquities will abound 
and the world will never be reformed. 

After all this is said, I mean not that you should relinquish the pursute of your 2500£, 
which is money out of your Pockett and for which you are a Debtor unto your Family. 
But for other pretensions, lett them goe for Heaven’s Sake, as you would a hott coale out 
of your hand: and strive to retire to your home in this Place, where you had the respect of 
all, and as much quiet as could be in this life, before your medling with that pernicious 
business of the Farme. 

There is no reason to suppose that Petty ever took such sensible advice. Yet, 
somehow, he kept his head well above water. 

In the later part of the correspondence Petty indulges in that complacent 
financial retrospect which he inserted in his Will and I have, perhaps too harshly, 
described as autobiographical boasting. It is possible that Southwell had heard 
of these financial triumphs rather often; at least there is a hint of this in the 
following: 

I will onely note that since you are soe Indulgent aa to think me worthy of being your 
Depositary in this great Audit, and expect by the Course of Nature that I should speake 
when you are Silent, you must allow me liberty without blame to aske questions when you 
seeme defitient or Redundant. 

That you are defitient may be suggested when, on the fortunate syde, I find noe Item 
for my Lady or of the hopefull stock she has brought you (p. 227). 

The shrewd thrust of the last sentence was deadly. The subject does not 
recur. 

I have indicated the character of the non-scientific part of the correspondence 
because we must examine Petty’s scientific writings in greater detail. I think, 
however, we have enough to justify a provisional diagnosis of Petty’s psycho¬ 
logical type. 

In literature and in life the perennial boy is often encountered. But while 
Peter Pan and Mr Reginald Fortune make far more friends than foes, that is not 
so true of their living counterparts. The exuberant flow of ideas and schemes, the 
intense and restless interest in everything which is characteristic of the clever 
child, often is extraordinarily attractive when it is associated with and con¬ 
trolled by the trained intelligence of a man. But the bad as well as the good 
points of a childlike or adolescent soul* are to be brought into the account. The 

* The first Marquis of Halifax said of King Charles that 1 his inclinations to love were the effects 
of health and a good constitution,' with as little mixture of the seraphic part as ever man had ’, and 
Petty held that the King was typical. In The Petty Papers (no. 93 of vol. 2) there is a memorandum 
headed ‘ Californian Marriages with the Reasons thereof’. ‘ In California ’, says Petty, ‘ 6 men were 
conjugerted to 6 women in order to beget many and well conditioned children, and for the greatest 
venereali pleasure, in manner following, viz.’ 

He then sets out the plan. One man ‘excelling in strength, nimbleness, beauty, wit, courage 
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clever child is often naively and intensely selfish, and so remains as the eternal 
hoy; his quite crude and unashamed egoism, his inability to understand that 
others have feelings and even rights, repel as strongly as his intellectual freshness 
attracts. How far he is a success in life depends on which way the balance 
turns. 

Petty seems to me a good example of this psychological type; its good points, 
the restless energy and exuberant flow of ideas, were sources of strength in such 
a time as that of the Civil War and Restoration, which, particularly the Restora¬ 
tion period, was in virtues and vices an age of grown-up children. Indeed his 
emotional adolescence may have shielded him from the deadly enmity of real 
men. Its bad points made him enemies, but they were children like himself. 
Nearly a century later, in a time of adults, these same characteristics, restless 
intellectual energy and vanity, exhibited by one no longer a rollicking adventurer 
but a great landowner, produced an unfavourable balance and we have ‘Mala- 
grida’. In Malagrida’s son, one has a change; the attractive traits, the eager 
interest in all sorts of things is still there, but the childish hungry vanity has 
been softened or sublimed. The cynic may say that it was easy for a great Whig 
lord 150 years ago to be agreeable, to keep himself hors concours ; perhaps it was, 
although the Dropmore Papers raise doubts. The fact, however, is certain. In the 
third Lord Lansdowne one sees the good and in the first the bad effects of the 
perennial boyishness of the ancestor. The ancestor lived in a state of society 
where the good points out weighted the bad points. That is why, although he 
made enemies and was often vexed, he was able to view his career with com¬ 
placency and to bequeath a great fortune. But it is not Petty as a man but 
Petty as a scientific worker who is the proper object of my study. 

How far does the psychological make-up which, as I think, characterized 
Petty conduce to scientific investigation? We might expect that it would be an 
immense stimulus to pioneering, that such a man would direct attention to a 
number of problems which deserved study, but that it would not lead to the 
production of any solid contribution to knowledge. Our task is to examine in 
some detail Petty’s scientific work. 

and good sense’ subsequently called the Hero, is allowed four women for his sole use. One Great 
Rich Woman is allowed five men who are to serve her when she pleases, but another woman is 
allotted to the fivo men for use in common by the five. 

It may he said this fable is only an after dinner jest—perhaps that is the whole explanation. 
But Petty doos go to the trouble of financial calculations, and does seem to suggest a serious con¬ 
sideration, (‘The encrease of children will he great and good.’ ‘No controversy about joynture, 
dower, maintenance, portion etc.’) Nobody emotionally adult would be likely to make Californian 
Marriages a basis for practical statecraft. 
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II. PETTY’S SCIENTIFIC WORK 

It is no part of my undertaking to survey the whole of Petty’s scientific 
activities, but to speak only of his medical and vital statistical work. 

In Hull’s edition of Petty’s writings, the editor discusses Petty’s status as an 
economist and remarks that Petty’s view that value depended upon labour was 
probably derived from Hobbes. The corn rent of agricultural lands was in Petty’s 
view determined by the excess of their produce over the expenses of cultivation, 
paid in corn, and the money value of the excess will be measured by the amount 
of silver which a miner, working for the same time as the cultivator of the corn 
land, will have left after meeting his expenses with a part of the silver he secures 
(Hull, p. lxxiii). Why there should be any surplus, he explains by density of 
population. 

Prof. Hull refrained from attempting to assess Petty’s work in terms of 
modern economic theory. A mere medical statistician will naturally follow this 
example. More than a century ago, Mr Chainmail had learned from Mr MacQuedy 
that the essence of a safe and economical currency was an interminable series of 
broken promises and added: ‘There seems to be a difference among the learned 
as to the way in which the promises ought to be broken; but I am not deep 
enough in their casuistry to enter into such nice distinctions.’ Medical statisti¬ 
cians may well adopt Mr Chainmail’s modest attitude towards the whole field 
of economic theory. Confining ourselves to statistics, we must consider what 
Petty thought should be done and what he actually did himself. 

Under the first heading, praise can be unstinted. More than 150 years before 
the establishment of the General Register Office, Petty specifically proposed the 
organization of a central statistical department the scope of which was wider 
than that of our existing General Register Office. It was to deal not only with 
births, marriages, burials, houses, the ages, sexes and occupations of the people, 
but with statistics of revenue, education and trade (see The Petty Papers, 1, 
171-2). He did not confine himself to vague recommendations, but drew up an 
enumeration schedule to be used for each parish. On this was to be entered: 
The number of housekeepers and of houses; the number of hearths; the number 
of statute acres; the number of people by sex and in age groups, viz. under 10, 
between 10 and 70, over 70; for males those aged 16 to 60, and for females those 
between 16 and 48 and how many of these latter were married; how many 
persons were incurable impotents and how many lived upon alms. This, it will 
be noted, is a better enumeration schedule than any used in England before the 
census of 1821. Further in his notes (printed in The Petty Papers) are various 
suggestions for the utilization of data collected in this way. 

The most striking is this: ‘The numbers of people that are of every yeare old 
from one to 100, and the number of them that dye at every such yeare’s age, 
do shew to how many yeare’s value the life of any person of any age is equivalent 
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and consequently makes a Par between the value of Estates for life and for 
years’ (The Petty Papers, 1, 193). 

This is, I think, the most remarkable thing Petty ever wrote, for it suggests 
that he had grasped the principle of an accurate life table, viz. a survivorship 
table based upon a knowledge of rates or mortality in age groups. No such table 
was constructed from population data until the end of the eighteenth century, 
because until then data of the age distribution of the living population were not 
obtained. Whether Petty also realized that under certain conditions a life table 
could be constructed without knowledge of the ages of the living population is 
a controversial matter which I shall discuss later on. 

Then he makes suggestions which are relevant enough to modem demo¬ 
graphic problems. 

By the proportion between, marriages and births, and of mothers to births, may be 
learnt what hindrance abortions and long suckling of children is to the speedier propagation 
of mankind; as also the difference of soyles and ayres to this foecundity of women. 

By the proportion between maryd and unmaryd teeming women, may be found in what 
number of yeeres the present stock of people may bee encreased to any number assigned 
answerable to the defect of the peopling of the nation for strength or trade. 

There are not wanting some suggestions which imply that even if Petty’s 
opinion of the Faculty were higher than that of Sydenham (whom we honoured 
posthumously ) it was tinged with scepticism. 

Whether they [viz. fellows and licentiates of the College of Physicians] take as much 
medicine and remedies as tho like number of any other society. 

Whether of 1000 patients to the best physicians, aged of any decade, there do not die 
as many as out of the inhabitants of places where there dwell no physicians. 

Whether of 100 sick of acute diseases who use physicians, as many die and in misery, 
as where no art is used, or only chance. (The Petty Papers, 2, 169-70.) 

This statistical experiment has not yet been performed and indeed might be 
hardly so conclusive as Petty implied. 

When one passes from what Petty, suggested to what he actually did himself, 
our praise must be qualified. As Prof. Hull said, he was ‘more than once misled 
into fancying that his conclusions were accurate because their form was 
definite’. 

In judging Petty it is but fair to contrast him with College contemporaries 
whose names are more honoured by us. Among his contemporaries in the 
College were Thomas Browne and Thomas Sydenham. Browne was a much 
older man than Petty, Sydenham almost his coeval. Of Browne’s quality as a 
physician we know nothing; but his literary influence indirectly—through 
Samuel Johnson—and directly upon generations of readers has been greater than 
that of any other practising medical man. Browne, like Petty, had an enormous 
range of interests and his book learning was greater. But, as we shall see, when 
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he tackles a problem of demography, Petty’s rashest guesses seem by com¬ 
parison as soberly scientific as an aimual report of the Registrar-General. 

Sydenham was an iconoclast in clinical practice and believed himself to be 
emancipated from the rule of ancient authority. No fantastic arithmetical 
calculations are to be found in his writings. In fact, with a single exception 
(Observations Medicae, 2, i), no arithmetic at all. It never seems to have entered 
his mind, although his greatest work purports to give the history of the diseases 
in London through a generation, that the arithmetical statements of the London 
Bills of Mortality were of any value whatever. 

Sydenham was too wise a man for us to think that he rejected the evidence 
because the data were compiled by illiterate old women. He would have known 
that the sworn searchers had the loquacity of their sex and rank and were likely 
to ask what ‘the doctor said 5 . He rejected it, because counting and measuring 
things did not come within his purview, just as the first beginnings of pathology 
and medical chemistry seemed, to him irrelevant. 

For the most part, Petty’s statistical work was severely practical, but there 
is one excursion into theory which is interesting. It is to be found in a section of 
his tract on the use of what he calls Duplicate Proportion and is reprinted by 
Hull (pp. 622-3). 

Petty states that there are more persons living between the ages of 16 and 
26 than in any other decade of life. The statement is not true for modern 
populations and was probably not true for the English population of Petty’s 
time. In 1861-71 (before the fall in the birth rate and infant mortality rate) 
there were 6-4 millions living under 10, and 4-0 between 16 and 25), But perhaps 
Petty meant that there were more living in the decade 16 to 26 than in any later 
decade, in which case his statement was of course right unless the birth rate was 
falling. 

He then asserts that the 

Roots of every number of Men’s Ages under 16 (whose Root is 4) compared with the 
said number 4, doth show the proportion of the likelyhood of such men reaching 70 years 
of Age. As for example: ‘Tis 4 times more likely that one of 10 years old should live to 70, 
than a new born Babe. ‘Tis three times more likely, that one of 9 years old should attain 
the age of 70, than the said infant. Moreover, ’tis twice as likely, that one of 16 should reach 
that Age, as that one of four years old should do it; and one third more likely, than for 
one of nine. 

We have no life table for England in 1674. Perhaps the nearest modern 
experience might be the Liverpool Table calculated by Farr seventy years ago. 
According to that table the chance of a new-born child living to be 66 was 
0-0976 and the chance of a person of 15 living to 65 was 0-202, which is about 
double the infants chance, not four,times as large. For the Healthy Districts, 
the chances are 0-4246 and 0-54585; that is, in a ratio of 1-28 to 1. 

Petty’s statements are wildly wrong. The interesting point is how did he 
reach them? The only figures he had were printed by Graunt. 
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This ‘Life Table* gives l x as follows: 

l„ 100 

1 . m 

bo 40 

be 25 

La 10 

Now if we take 2 as the survivors to 70 (it does not of course matter what the 
numerator is for comparative purposes), then the infant’s chance of surviving 
to 70 is 0*02 and the person of 16 has the chance 1/20 = 0-05, a ratio of 2-5, not 
wildly different from the Liverpool Table figure and very different from 4-0. 

A fortiori when Petty, having passed above age 16, asserts that ‘it is five to 
four, that one of 26 years old will die before one of 16; and 6 to 6 that one of 36 
will die before one of 26 ’, we are in a region of pure fantasy because, even if he 
had had the statistical data, Petty would not have had the technical knowledge 
to solve the problem involved, viz. to find the probability that of two lives aged 
respectively x and y, the former will fall before the latter. 

If we keep within the range of the simple arithmetic which Petty used, the 
result cannot be obtained. 

He then passes to this statement: 

To provo all which I can produce tho aocompts of every Man, Woman, and Child, 
within a oertain Parish of above 330 Souls; all which particular Ages being cast up, and 
added together, and the Sum divided by the whole number of Souls, made the Quotient 
between 15 and 16; which I call (if it be Constant or Uniform) the Age of that Parish, or 
Numerua Index of Longaevity there. Many of whioh Indexes for several times and places, 
would make a useful Scale of Salubrity for those places, and a better Judg of Ayers than 
tho conjectural Notions we commonly read and talk of. And such a Scale the King might 
as easily make for all his Dominions, as I did for this one Parish. 

The puzzle is to discover why Petty thought this statistical experiment 
proved his point and why he regarded the mean age of the population of a parish 
its index of longevity. The first question I cannot answer at all; about the second 
I can make a guess. If the parish population were supported solely by births and 
there was no migration, then, if the death rates at ages did not vary, the popula¬ 
tion would be a stationary population and both the mean age of the living and 
the mean age at death would be constant. The expectation of life is greater than 
the mean age of the living unless the rates of mortality at early ages are very 
high and the more favourable the rates of mortality the greater will be the 
difference. In Petty’s day, when mortality at early ages was very high, the two 
constants were probably not far apart, but it is certain that both expectation of 
life and mean age of a life table population were greater than 16; probably of 
order 28 to 32. 

I think we may be sure that the parish Petty counted was not stationary in 
the statistical sense, but had an excess of births over deaths, and that his average 
threw no light upon the rates of mortality. 


bo 10 

be 0 

be 3 

bo 1 
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Passing to practical statistics, it will be convenient first to note rapidly 
statistical observations which are incidental in treatises of primarily financial or 
economic interest. In the Verbum sapienti, which although not printed until 
1691 was written as early as 1665, Petty attempts to reckon what a man is 
worth. Here is the method. He concludes from financial data that the annual 
proceed of the Stock or Wealth of the nation yields 15 millions, but that the 
expenses of the nation are 40 .millions. So the balance of 25 millions must be 
derived from the labour of the people. He assumes that the population is 6 
millions and that half of these can work, and earn £8. 6s. 8 d. a head per 
annum. This would be Id. a day, abating 52 Sundays and half as many other days 
for sickness, holidays, etc. ‘Whereas the Stock of Kingdom, yielding but 15 
Millions of proceed, is worth 250 Millions; then the People who yield 25, are 
worth 416 2/3 Millions. For although the Individuums of Mankind be reckoned 
at about 8 years purchase; the Species of them is worth as many as Land, being 
in its nature as perpetual, for ought we know.’ 

Why an individual’s working life is worth only 8 years’ purchase is not clear. 
One would be inclined to put it as the average number of years lived in the 
working period of life. Perhaps Petty took Graunt’s table and worked out the 
average number of years of life lived between the ages of 16 and 56; it is 
nearly 8. 

He then calculates the money loss due to 100,000 dying of the plague and 
makes it nearly 7 millions, adding that £70,000 would have been well disposed 
in preventing this ‘centuple loss’. Perhaps this is the first printed statement 
of the neglected truth that public health measures pay. 

Since Petty’s day, others, including Farr himself, have done sums of this 
kind; it is a popular occupation in the United States of America. 

Farr went to work more elaborately, making out a balance sheet of a man 
from the cradle to the grave. But the principle was much the same. We cannot 
say it is a wholly useless pastime. There is of course the difficulty that if more 
lives are saved the price of labour might fall. But to Petty that would have been 
no difficulty, because he held that wealth is purely relative, viz. that if the income 
of each person in a community is halved, everybody is as well off as before. 

In the Political Anatomy of Ireland , Petty seeks to determine war losses in 
Ireland. 

The number of the People being now Anno 1672 about 1,100,000 and Anno 1652 about 
850 M. Because I conceive that 80 M. of them have in 20 years encreased by generation 
70 M. by return of banished and expelled English; as also by the access of new ones, 
80 M. of New Scots, and 20 M. of returned Irish, being all 250 M. 

Now if it could be known what number of people were in Ireland Ann. 1641, then the 
difference between the said number, and 850, adding unto it the increase by generation in 
11 years will shew the destruction of people made by the Wars, viz. by tjie Sword, Plague 
and Famine occasioned thereby. 

I find by comparing superfluous and spare Oxen, Sheep, Butter and Beef that there was 
exported above 1/3 more Ann. 1664 than in 1641, which shews there were 1/3 more of 
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people, viz. 1,466,000. Out of which Sum take what were left Ann. 1652, there will remain 
616,000 destroyed by the Rebellion. 

Whereas the present proportion of the British is as 3 to 11; But before the Wars the 
proportion was less, viz. as 2 to 11 and then it follows that the number of British slain in 
11 years was 112 thousand Souls; of which I guess 2/3 to have perished by War, Plague and 
Famine. So as it follows that 37,000 were massacred in the first year of Tumults: So as 
those who think 154,000 were so destroyed, ought to review the grounds of their Opinions. 

It follows also, that about 604 M. of the Irish perished, and were wasted by the Sword, 
Plague and Famine, Hardship and Banishment, between the 23 of October 1641 and the 
same day 1652. Wherefore those who say, That not 1/8 of them remained at the end of the 
Wars, must also review their opinions j there being by this Computation near 2/3 of them; 
which Opinion I also submit. 

Assuming, which is rash, that the estimates of population in 1672 and 1662 
are correct, the assumption that population varied inversely as exportation of 
cattle seems bold. Might it not be that shipping facilities were better in 1664 
than in 1641 ? Had there been no exportation we could not infer the population 
to be infinite. 

Again Petty has multiplied the estimate for 1672 by 1-333. But he needed 
the population of 1664, which presumably was smaller than that of 1672. If his 
estimate is right, the population was increasing at the rate of about 12-5 
thousands per annum, so he should have multiplied 1,000,000 not 1,100,000 by 
1-333 and has overestimated the 1641 population by 133,330, and therefore the 
number destroyed by the same amount, an overstatement of 20 %. But this is 
not all. If we assign the decrement of population between 1652 and 1641 wholly 
to sword, plague and famine, we must assume that births continued at the peace¬ 
time rate; not a likely assumption. Lastly, it seems unreasonable to assign the 
casualties to the two races in precise proportion to their estimated numerical 
strength in the population of 1641. 

How it follows that 37;000 were massacred in the first year of tumults I do 
not know. 

In a later-work (Treatise of Ireland, pp, 610-11) Petty has another shot at 
this problem. 

He now assumes that Graunt’s deduction from a Hampshire parish register, 
viz. that christenings are to burials in the ratio of 5 to 4, applies to Ireland, and 
that the death rate is 1 in 30, i.e. about what Graunt estimated for London and 
much higher than his estimate for the country. He then proceeds in this way. 
He estimates the population of 1653 to be 900,000 and that of 1687, 1,300,000. 
Then taking 1/30 for the death rate and 1/24 for birth rate, he makes the 
population of 1662, 985,000. He does not comment on the great deorease be¬ 
tween 1652 and 1653; hut there was still war in Ireland in 1652. 

He now says that the population of 1641 was greater than that of 1687, ‘as 
appears by the Exportations, Importations, Tyths, Grist-Mills and the Judg¬ 
ment of Intelligent Persons’. This time he takes the population to be 1,400,000— 
a little less. than, in the earlier estimate—and by the same kind of reasoning 
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again makes the war losses to be about 600,000. One is reminded of Hull’s 
remai'k that Petty confused the accurate with the definite. Also one notes the 
inevitable tendency of a polemical writer—which Petty very decidedly was— 
to maintain his original assertion. Those of us who have never yielded to this 
temptation may cast stones at him. It is not I believe too cynical to say that 
any calculation Petty made would have made the war losses around 600,000. 

Returning to the Political Anatomy of Ireland, we find here a distinct claim 
that the mean age at death (not the mean age of the living) measures longevity. 

As to Longaevity, inquiry must be made into some good old Register of (suppose) 
20 persons, who were all born and buried in the same Parish, and having cast up the time 
which they all lived as one man, the Total divided by 20 is the life of each one with another; 
which oompared with the lilco Observation in several other places, will show the difference 
of Longaevity, due allowance being made for extraordinary contingencies and Epidemical 
Diseases happening respectively within the period of each Observation (p. 172). 

Apart from what we should think the absurdity of basing important con¬ 
clusions upon an average of 20—and Petty only gives 20 as a figure—-the mean 
ages at death of different populations are not comparable unless in each place 
the population is stationary in the sense described above. But, since so acute a 
man as Edwin Chadwick made the same mistake in the nineteenth century as 
Petty in the seventeenth century and it continues to be made in various places 
in the twentieth century, we need not be superior. 

We now come to Petty’s purely statistical work which is concerned with the 
growth of population; before examining this in detail, it will be convenient to 
consider the methods available in the seventeenth century for estimating 
population and notions then current on what may be called the theory of 
population growth. 

It is hard to believe that in the ancient world nobody studied demography 
arithmetically. There is evidence that the Romans enumerated citizens—the 
word census is pure Latin—and it has been suggested that the Romans made 
life tables. Gouraud, cited by Todhunter ( History of the Mathematical Theory of 
Probability, p. 14), refers to a passage cited from Ulpian in the Digest which I 
have discussed elsewhere.* The question was of the value of annuities and the 
conclusion I reached was that Ulpian had no vital statistical basis whatever for 
his figures, that he simply began with the capital value the law gave for any 
usufruct and. then, realizing that people do die eventually, made some sub¬ 
tractions, ending with the absurd (vital-statistically speaking) conclusion that 
after the age of 60 the rate of mortality was independent of age. 

There is not, I think, any reason to believe that the practical Romans had 
anticipated Graunt and Petty. 

That is not to say that nobody studied any demographical problems arith¬ 
metically. Indeed one fellow of the College of Physicians who has had—and will 

* Joum. Roy. Slot. Soc. 103 (1940), 246. 
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continue to have—a hundred readers for every one reader of Graunt and Petty 
made an elaborate .demographical calculation. This was Sir Thomas Browne. 
Sir Thomas devoted the sixth chapter of the sixth book of Pseudodoxia to the 
vulgar opinion that the earth was slenderly peopled before the Flood. 

This vulgar opinion Sir Thomas found to be very wide of the mark. Indeed, 
far from the earth being slenderly peopled, ‘we shall rather admire how the 
earth contained its inhabitants, than doubt its inhabitation: and might con¬ 
ceive the deluge not simply penall, but in some way also necessary, as many 
have conceived of translations, if Adam had not sinned, and the race of man had 
remained upon earth immortal 1 . Indeed Sir Thomas estimates that by the 
seventh century of the world’s history its population amounted to 1,347,368,420. 
He reaches this result in the following way: 

Having thus declared how powerfully the length of lives conduced unto populosity of 
those times, it will yet be easier acknowledged if we descend to particularities, and consider 
how many in seven hundred years might descend from one man; wherein considering the 
length of their dayes, we may conceive the greatest number to bave.been alive together. 
And this that no reasonable spirit may contradict, we will declare with manifest dis¬ 
advantage ; for whereas the duration of the world unto the flood was about 1,600 years, we 
will make our compute in less than half that time. Nor will we begin with the first man, 
hut allow the earth to he provided of women fit for marriage the second or third first 
centuries; and will only take as granted, that they might beget children at sixty, and at 
an hundred years have twenty, allowing for that number forty years. Nor will wo herein 
single out Methuselah, or account from the longest livers, but make choice of the shortest 
of any we find recorded in the Text, excepting Enoch : who after he had lived as many years 
as there be days in the year was translated at 3 68. And thus from one stock of seven hundred 
years, multiplying still by twenty, we shall find the product to be one thousand, three 
hundred forty seven millions, three hundred sixty eight thousand, four hundred and 
twenty, 

1 . 20 . 

2. 400. 

3. 8,000. 

4. 160,000. 

Century. 5. 3,200,000. 

6. 64,000,000. 

7. 1,280,000,000. 

1,347,368,420. 

Simply as a sum, there are difficulties about this result. If our 20 are equal 
numbers of males and females, it is not 20 which should he multiplied by 20 but 
10. If they are all males, then women are left out of the reckoning. But, per¬ 
haps, as the Text does not record the ages of women, Sir Thomas esteemed them 
as ephemerids, sufficiently plentiful however to provide a wife for every husband. 
But then I think he should have said that the 20 to be begotten between 60 and 
100 were all males. Anyhow the sum must be wrong because some of the 
64,000,000 short-lived women of the sixth century should survive into the seventh. 
Indeed Sir Thomas uses his data a trifle capriciously. 

We must surely play a game according to the rules. We are to accept the 
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Text word for word as it stands. But, omitting Adam, whose age at his begetting 
of Cain is not recorded, and Noah, who seems to have reached middle age— 
500 years—before becoming a father, the reproductive habits of eight fathers are 
recorded. Two begat males at the age of 65, one at 70, one at 90, one at 105, one 
at 162, one at 182 and one at 187. When this primary business was over, they 
are all recorded to have begotten an unspecified number of sons and daughters. 
So, if we are to be faithful to the Text, a very much more complicated arith¬ 
metical problem presents itself. A male begets another male at an average age 
of about 100, he then begets males and females at an unspecified rate for say 
another 600 years, required the law of increase. The Text does not authorize Sir 
Thomas to start pre-diluvian breeding at 65 or to stop it at 100. His ‘manifest 
disadvantage ’ is breaking the rules of the game. 

Further, the Text does not entitle him to predicate of the other males the 
lengths of days and procreative exploits of the recorded eight. 

All this, it may be said, is breaking a butterfly upon the wheel. Nobody now 
takes the statistics of the Authorized Version literally. The point is that Sir 
Thomas Browne did, but used them improperly. As Lord Chesterfield said to a 
Garter King at Arms of his day who had not followed the rules of heraldry, 
‘You foolish man, you don’t know your own foolish business’. 

Petty did not tackle pre-diluvian demography, but he did try his hand at an 
estimate of the world’s population after the flood, ‘To justify the Scriptures and 
all other good Histm-ies concerning the Number of the People in Ancient Time ’ 
(p. 465). 

As Petty was not going to allow the population of ancient times to be greater 
than in the seventeenth century, but to make it increase regularly from the time 
of Noah’s Ark, common sense saved him from fantastic figures, but not from, 
physiological difficulties. The rules of the game obliged him to start with eight 
landed from the Ark, so he thought it best to make them increase and multiply 
very fast indeed at first and progressively more slowly. At first he doubled the 
population every ten years, but by the birth of Christ has brought the period up 
to 1000 years. But doubling every ten years (in the first century from the Flood) 
leads one into difficulties. 

We can allow the possibility of the four pairs emerged from the Ark pro¬ 
ducing 8 offspring in ten years and so becoming 16 in year 10, without too great 
difficulty. But ten years later they must number 32 and this is a difficulty. If 
the fecundity of the first settlers remains the same they will contribute 8 more 
children, giving us a population of 24, the balance of 8 must come from the four 
couples of children all of whom must be under 20, and this is a little difficult. 

But at least we may say that there is nothing wholly fantastic in Petty’s 
procedure. Petty does belong to a different arithmetical world from that of 
Browne. Here we may leave purely speculative demography. 

To estimate the people of an area without counting them, we must count 
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something which has a connexion with the number of the people. We may count 
the tax-payers, the houses, the burials, the christenings or the acreage under 
corn—all or any of these items vary with the number of people. 

I wish to keep separate the discussions of Petty’s and Graunt’s statistical 
researches, but in the matter now to be examined Petty used some of Graunt’s 
methods and results, so these must be considered. 

Graunt used three methods of estimation. In the first place, be surmised 
that the number of child-bearing women in a community might be about double 
the number of annual births ‘forasmuch as such women, one with another, have 
scarce more than one child in two years’. Then he surmised that families were 
twice as numerous as women of child-bearing age. His reasoning was that 
women between 16 and 76 might be twice as numerous as women between 16 
and 40 or 20 and 44 (i.e. of child-bearing age), and he thought of a family as 
centred round a married couple. Finally, he thought that the average family 
would consist of eight persons, the husband and wife, three children and three 
servants or lodgers. So, starting with 12,000 christenings, which he thought a 
fair measure of annual births, he reaches 24,000 women of fertile age, then 
48,000 families and lastly 384,000 persons. 

It is quite certain that Graunt’s estimate of an annual fertility rate of 500 
per 1000 was an enormous overstatement. In London in 1851, the ratio of 
legitimate births to married women aged 15-45 was 251'8 per 1000. There is no 
reason to believe that nuptial fertility changed appreciably between 1660 and 
1860. But an error of this kind would lead him to an understatement of families, 
Now, however, another error saves him. We cannot be so positive that eight to 
the family is a great overstatement as we can that the marital fertility was not 
500 per 1000, but it is much higher than any nineteenth-century finding. Using 
this multiplier saves Graunt in this sense, that his quaint rule gives almost 
precisely the right answer for the population of London nearly 200 years after 
his time. 

The legitimate births registered in London in 1851 were 75,097. This, ac¬ 
cording to Graunt’s rule, is to be multiplied by 32. The result is 2,403,104. The 
enumerated population was 2,363,236; the conjecture is only 1-7 % out. Sic me 
senmit Apollo. 

Graunt’s next method was experimental and very briefly described. He 
counted the numbers of families in certain parishes within the walls and found 
that ‘3 out of 11 Families per annum have died’. He then multiplies the burials 
for the year (13,000) by 11/3, and proceeds as before. 

Finally, he took Newcourt’s map of London and 

guessed that in 100 Yards square there might be about 54 Families, supposing every House 
to be 20 Foot in the front: for on two sides of the said square there will be 100 Yards of 
Housing in each, and in the two other sides 80 each; in all 360 Yards: that is 54 Families 
in each square, of which there are 220 within the Walls, making in all 11880 Families within 
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the Walls, But forasmuch as there die within the Walls about 3200 per Annum, and in the 
whole 13,000, it follows that the Housing within the Walls is \ part of the whole, and conse¬ 
quently, that there are 47,520 Families in and about London, which agrees well enough 
with all my former computations (p. 386). 

These conjectures led Graunt to think that the rate of mortality in London 
was about 1 in 32. In his first essay on the growth of London (pp. 458-75) Petty 
bases himself upon that estimate, and in the series of papers (pp. 505-44) this 
remains the fundamental method, but Petty allows himself to modify the 
multiplier, not altogether without suspicion of bias. At a quite early stage he 
had satisfied himself that London was the largest city in the world and much 
larger than Paris. This is the kind of argument. For the three years 1682-84, 
the average of burials in London was 22,337 and for Paris 19,887. So if the rates 
of mortality were the same, London was larger than Paris.* If the rate of 
mortality in Paris were higher than in London then the population of London 
must be larger still. According to Petty (a) a larger proportion of the Paris 
population died in hospital, ( b ) the mortality in hospital was heavier in Paris 
than in London. So it follows that the general death rate of Paris was higher. 

That at London the Hospitals are better and more desirable than those of Paris, for 
that in the best at Paris there die 2 out of 16, whereas at London there die out of the worst 
scarce 2 of 16, and yet but a fiftieth part of the whole die out of the Hospitals at London, 
and 2/6 or 20 times that proportion die out of the Paris Hospitals which are of the same 
kind; that is to say, the number of those at London who chuse to lie sick in Hospitals rather 
than in their own Houses, are to the like People of Paris as one to twenty; which shows the 
greater Poverty or want of Means in the People of Paris than those of London. We infer 
from the premisses, viz. the dying scarce 2 of 16 out of the London Hospitals, and about 
2 of 16 in the best of Paris (to say nothing of Vhostel Disu) that either the Physicians and 
Chirurgeons of London are better than those of Paris, or that the Air of London is more 
wholesome (p. 608). 

These, however, are only logical deductions if the user of the hospitals in 
London and Paris is identical. If, as implied in the first part of the quotation, 
we think of hospitals in the sense which our elder contemporaries think of the 
old-fashioned poor law infirmaries, viz. as refuges for the sick poor, it would 
mean that in Paris more of the aged indigent died in institutions than in London 
and heavy mortality might well have nothing to do with the skill or lack of skill 
of the medical staff. If we think of hospitals in the modern sense, then heavy 
mortality might be a mere reflection of the resort to these hospitals of persons 
suffering from illnesses which needed special treatment. In any case, Petty can 
hardly have it both ways. In another essay (pp. 510-11) he contrasts the higher 
ratio of deaths to admissions at l’hostel Dieu of Paris with that of la Charity, 
argues that the excess in F hostel Dieu is unnecessary and proceeds to calculate 

* It should, be remembered that the London of Petty’s calculations is the whole area within 
the Bills. The calculations of Graunt described above did not include Westminster or the six out- 
parishes of Surrey and Middlesex which were within the Bills: Islington, Lambeth, Stepney, 
Newington, Hackney, Redriff. 
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what the French nation would gain by saving this excess. But he has not in¬ 
quired whether the patients of the two institutions were in pari materia. 

Here is an historical problem which might be solved by those familiar with 
the literature of the period. Its discussion would not be relevant here. It is, 
however, only just to Petty to say that, unless conditions deteriorated seriously 
in the following century, his strictures on l’hostel Dieu were justified. In 
Franklin’s work (La Vie Privee d'autrefois. H Hygiene (Paris 1890), pp. 177 et 
seq.) an appalling account of this hospital from the pen of the eminent surgeon 
Tenon, printed in 1788, is quoted. Tenon’s description of the routine of this 
great hospital compares, unfavourably, with the story of the wounded in the 
Mesopotamian campaign which horrified England in the war of 1914-18. He 
remarks, inter alia, ‘on ne guerissoit point de trepanes autrefois 4 l’Hotel-Dieu, 
eomme on n’en guerit pas encore aujourd’hui ’, and cites a court surgeon of the 
time of Louis XIV, i.e. a contemporary of Petty, to that effect. His account of 
the treatment of lying-in women is grotesquely horrible. 

In another essay (pp. 533-6) Petty discusses methods of estimation more 
carefully than in his other papers. 

He proposes to show that the population of London (within the Bills) in or 
about 1685 was approximately 696,000. 

There are, he says, three methods: (1) From houses and families. (2) From 
an estimated death rate. (3) From the ratio of those who die of the plague to 
those who escape. 

This last we may deal with at once. Petty asserts that Graunt had proved 
that one-fifth of the people died of the plague. But in 1665, 98,000 died of the 
plague; therefore the population was 490,000, and allowing an increase of one- 
third between 1665 and 1686 we reach 653,000, 

Graunt could not have proved that one-fifth of the population died of the 
plague unless he knew what the population was, and he never claimed to have 
done so. 

The other methods (which Graunt used) are rational. 

To estimate houses, Petty used three methods. He says that in the Fire of 
1666, 13,200 houses were burned and that deaths from these houses were one- 
fifth of total deaths, so he reckons the houses to have been 66,000. Then as 
burials in 1686 were to burials in 1666 as 4 to 3, he makes the houses of 1686, 
88,000. He does not, however, say upon what basis the estimate of one-fifth of 
the deathB in 1666 stands. 

Next, he gives an estimate of the houses in 1682 given him by those employed 
upon a map said to have been made in that year. This map has not been 
identified. 

Lastly, he uses the return of hearths. In Dublin in 1685 the hearths were 
29,325 and the houses 6400. In London the hearths were 388,000; so the houses 
on the Dublin ratio should be 87,000. In Bristol he says there were 5307 houses 
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and 16,752 hearths, which give 123,000. houses for London; the mean of the 
calculations is 105,000. The Hearth Office itself, he says, certified the number 
to be 105,315. He must now have a multiplier. He accepts Graunt’s multiplier 
of 8 as valid for tradesmen’s families, but allows for smaller families among the 
poor and larger among the rich, finally choosing 6. He then allows for double 
families in houses by adding 10,531 to his 105,315, and multiplying the sum by 
6 has 695,076 for the population. 

' Petty’s second way was from an estimated death rate. 

Petty multiplies the average of the burials in 1684 and 1685 (23,212) by 30, 
which makes the population 696,360. 

He now essays to prove that the death rate in London was 1 in 30. He uses 
four arguments, of which only one is strictly to the point, viz. Graunt’s direct 
observation that three deaths occur annually in eleven families—which however 
involves the assumption of eight persons to the families observed. Two others 
are relevant, viz. observations, apparently direct, that in ‘healthful places’ the 
mortality is 1 in 50 and in nine country parishes 1 hi 37. The fourth partly rests 
upon a statement which Graunt did not make, viz. that one of 20 children under 
10 dies annually. This fictitious value Petty averages with the statement of a 
M. Auzout to the effect that the rate of mortality of adults in Rome is 1 in 40. 
It will be dear that Petty has proved nothing at all. What he has done is to make 
it unlikely that the rate of mortality was less than 1 in 30. That, perhaps, was 
enough. One has a certain sympathy with his round statement: ‘Till I see 
another round number, grounded upon many observations, nearer than 30, 
I hope to have done pretty well in multiplying our Burials by 30 to find the 
number of the People.’ 

With this I may conclude the analysis of Petty’s statistical work. It will, 
I think, soon be clear enough that it is not of the calibre of Graunt’s. Yet I 
caimot take leave of it without something of an ave. Careless, happy-go-lucky, 
tendentious; yes, all that. But anybody who has felt the exhilaration, to which 
Francis Galton owned, in the doing of sums concerning biological problems, feels 
his heart warmed by the arithmetical knight errant who had so many statistical 
adventures. 


(To be continued) 
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1. Introduction 

The theory of confidence intervals was started by the present author about 1930. 
At that time it was taught in lectures, given both at the Univei’sity and at the 
Central College of Agriculture, Warsaw, Poland. The theory found immediate 
practical applications, and before any theoretical paper was published, a booklet 
(Pytkowslti, 1932) appeared giving numerical confidence intervals for means and 
for regression coefficients. The term ‘confidence interval’ is a translation of the 
original Polish ‘przedzial ufnosci’. The author’s theoretical results appeared two 
years later (Neyman, 1934). At almost the same time the first tables and graphs 
of confidence intervals were published (Clopper & Pearson, 1934) in a paper which 
gave a remarkably clear explanation of the difference between the new approach 
to the problem of estimation and the old one, by means of Bayes’s theorem. 

The first publication on fiducial argument (Fisher, 1930) anticipated the booklet 
of Pytkowski by two years. The present author-overlooked this article for some 
time. However, when preparing his paper of 1934, he was already acquainted with 
it and also with the next paper (Fisher, 1933) on a similar subject. Although 
Fisher’s method of approach was entirely different from the author’s, the 
numerical identity of Fisher’s fiducial limits with the confidence limits in the 
author’s theory, and also some of Fisher’s early comments, suggested to the author 
that the two theories are essentially the same. Accordingly, and owing to the 
difference in dates of publications, the author considered his own work as an 
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extension of the previous results of Fisher. This was clearly stated in the author’s 
paper of 1934. 

Apart from the above points of agreement the author had found certain 
passages and conceptions in the publications of Fisher which were difficult for 
him to understand and to reconcile with what was essential in the theory of con¬ 
fidence intervals. They included ‘fiducial probability’ and ‘fiducial distribution 
of a parameter ’. However, the author was inclined to think that these were, more 
or less, lapsus linguae, difficult to avoid in the early stages of a new theory. This 
attitude was clearly expressed in the paper of 1934. That paper was read before 
a meeting of the Royal Statistical Society and was followed by a public discussion 
recorded in the Society’s Journal. Fisher took part in the discussion, and it was 
a great surprise to the author to find that, far from recognizing them as mis¬ 
understandings, he considered fiducial probability and fiducial distributions as 
absolutely essential parts of his theory. As a result, the author began to doubt 
whether the two theories were, in fact, equivalent. These doubts were only 
increased by Fisher’s insistence that the calculation of fiducial distributions and 
fiducial limits must be limited to cases where sufficient statistics exist (Fisher, 
1936), and by his warnings against inconsistencies in the theory of confidence 
intervals. 

When questioned on the subject, the author could not conceal his doubts and 
they were published (Neyman, 1938a). Subsequent publications by other authors 
appear to be divided. Some, e.g. the very important papers by Wald (1939) and 
by Wald & Wolfowitz (1939), deal with the theory of confidence intervals, entirely 
ignoring fiducial theory. Others (Starkey, 1938; Sukhatme, 1938; Yates, 1939), 
at the other extreme, work on the ground of fiducial argument and ignore the 
confidence intervals. There is also an intermediate group of authors with an almost 
continuous spectrum of opinions. Pitman (1939), in a very interesting paper on 
estimation of location and scale parameters, states that the two theories ‘ are 
essentially the same and that their two points of view are both necessary for a full 
comprehension of the theory of estimation’. And a few pages further: ‘I at first 
called it the fiducial probability function, but finally decided to shorten the name 
by dropping the word “probability 

Next we find the statement (Bartlett, 1939) that ‘ by a distribution of fiducial 
type we shall mean a distribution providing at least confidence intervals in the 
sense of Neyman’. This statement is used in an argument (Bartlett, 1936, 1939) 
that, as a distribution deduced by Fisher (1936) does not seem to provide con¬ 
fidence limits, there must be some error in the deduction. A similar point of view, 
but with a stronger leaning towards confidence intervals, is expressed by Welch 
(1939). In this paper various general claims of Fisher are analysed, essentially 
from the point of view of confidence intervals, and tested on appropriate examples. 
Among other things it is found that the fears of inconsistencies in the theory of 
confidence intervals are unfounded. 



130 Fiducial argument and the theory of confidence intervals 

A quite different school of thought is represented by Jeffreys (1940), according 
to which the fiducial approach to the problem of estimation is completely equi¬ 
valent with that by inverse probability. 

Fisher (1937, 1939a, 19396) and Yates (1939) emphatically deny that there 
is an error in Fisher’s paper of 1936. On the contrary, it is said that the results 
then published were obscured by the controversy arising from Bartlett’s con¬ 
fusion about the nature of fiducial argument. Also, especially in earlier papers 
(1930, 1933, 1936), Fisher is equally emphatic on the distinction between the 
fiducial and the inverse probability approaches to the problem of estimation. 

The above survey shows that there is an interesting divergence of opinions as 
to what is essential in the fiducial theory in general and as to whether it is in any 
way connected with the theory of confidence intervals. The perusal of .all the 
literature quoted does not allow the present author to form any precise opinion 
as to the first of these questions. On the other hand, there now seems to be sufficient 
ground for answering the second, concerning the relationship between the two 
theories. The purpose of the present paper' is to show that there is none. The 
relevant points concerning this question, which were possible to establish on the 
ground of earlier literature, are explained in excellent papers by Pearson (1939) 
and Welch (1939), with the final conclusion that, in spite of various differences, 
the two theories are closely related. However, fresh evidence provided by papers 
of Fisher (1939 a, 19396) and Yates (1939) shows that no such relation exists and 
that the authors suspecting it were misled by the incompleteness of earlier writings 
concerning fiducial argument. 

As a result of the present paper it may be found expedient, for the sake of 
clarity, to avoid confusion of terminologies appropriate to the two theories. 
Instead of writing, as some authors do, on ‘ fiducial or confidence * limits, it may he 
preferable to discuss ‘fiducial limits’ or ‘confidence limits’, as the case may be, 
separately. 


2. Basic) ideas in the theory of confidence intervals 

The key to understanding the theory of confidence intervals is in being clear 
about what might be called the classical point of view in the theory of probability. 
This theory was originally built up to answer questions about how frequently a 
given combination of throws will occur in a long series of games of dice. Thus; the 
probability of a certain combination found to be, say, 1/5, implies that this com¬ 
bination would appear in about 20 % of a long series of actual games. This agree¬ 
ment may, but need not, be observed. In the latter case, we would say that the 
assumptions underlying the deduction were not realized by the actual experi¬ 
ments. The dice used were perhaps ‘ biased and so forth. The point is that, when¬ 
ever it is said that a given set of probabilities does refer to some phenomena, then 
it is understood that the relative frequencies of various aspects of the phenomena, 
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in a long series of trials, are approximately equal to corresponding probabilities. 
This is just what the author calls the classical point of view in the theory of 
probability. It is excellently explained by v. Mises (1939), but is more general than 
the definition of probability adopted by that author.* 

Apart from the classical point of view on probability, there is another. It 
considers the probabilities as measures of rational belief in the truth of a given 
proposition. Here the agreement between the probability and some relative 
frequency is not essential. 

The theory of confidence intervals was built up to give a solution of problems 
of estimation which would have a clear frequency interpretation, characteristic 
of the classical point of view. Consider a set E of n observable random variables, 
aq,..., x n , and assume as given that the function p(E | 6 V d 2 , ...,d s ) represents its 
elementary probability law. Here 0i,-A represent certain parameters whose 
values are unknown. 

The above should be interpreted as follows. There are some actual trials T 
which are able to determine the values of the afs. There are also some numbers 
i9q, d 2 ,...,■£>■„ unknown to us, such that, whatever be a region w in the space of 
the afs, the integral of p(E | ..., d- 8 ) taken over this region is approximately 

equal to the relative frequency with which the point E, as determined by the 
trials T , falls within that region w. The problem of estimating one of the para¬ 
meters, e.g. 6 X , consists in using just one system of the afs as determined by the 
trials T to calculate i9q approximately. Alternatively, it may consist in calculating 
an interval (a,a + d) which ‘presumably’ covers fiq. 

The original approach to this problem is based on Bayes’s theorem. Denote 
by p(6 x , (?„,..., 0 3 ) the elementary probability law of the 0’s. Then 

p(9 1 ,...,6 s )p{E'\G x ,...,0 8 ) _ ^ 

jp(0 1 ,...,O e )p(E' |0 1# ...,9 s )dd Xi ...,d6 s 

will be the relative probability law, or the probability law a posteriori of all the 
6’ s given the observed system E' of the values of the afs. It can be used to calculate 
the most probable value of 6 X . Alternatively, given a number d> 0, the law can be 
used to- find the interval (a, a + d) such that the a posteriori probability 

P{a+d>6 x >a \ E'} 

is greatest. 

Our attitude towards this kind of solution, dictated by the classical point of 
view on probability, depends on circumstances and may be twofold. 

The circumstances of the problem may imply not only that the afs but also that 
the 0’s are random variables and that the function p{9 x , ...,d 3 ) could be used to 

* It will be noticed that the classical point of view or probability does not imply any particular 
definition of that concept. It is not suggested that the one adopted by v. Mises is the only one 
that could be consistently used. 


p(6 x ,9 2) ...,0 s \E') = 


I- 
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calculate the relative frequencies of various combinations of values of the 8's. 
Such situations are rare, but they do occasionally occur, especially in problems of 
genetics and of mass production. If the function p(8 i , ...,d a ) is implied by the 
problem considered, then the probability P{a + d>O l >a \ E’} has a clear fre¬ 
quency interpretation, as follows. Imagine a long sequence, $, of cases where the 
8’s vary according to the above law and the afs are determined by the particular 
trials considered. Pick from this sequence S a subsequence S(E') of such trials 
in which the experiments determined the same system of values of the %’&, namely, 
the system W. Naturally, the value of 0 1 in cases belonging to 8(E") would vary. 
But, if the functions p (E \ 8 V ..., 0 S ) and p(0 v .... 0 S ) do have the presumed relation 
to the trials considered, it will be found that among all the intervals of length d, 
the interval (a, a + d) will contain the value of 8 X more frequently than any other, 
and that this frequency will be approximately equal to P{a + d > 9 X > a | E'}. It 
follows that, if the function p(0 x ,..., 6 S ) is implied by the circumstances of the 
problem of estimation, the use of the formula (I) is perfectly legitimate from the 
point of view of the classical theory of probability. 

The situation is quite different when the circumstances of the problem do not 
imply the a 'priori probability law. This is most frequently the case. Moreover, 
usually there are serious difficulties in considering the 9 ’s as random variables. 
Jeffreys (1939) advises the use of formula (1) also in such cases, with a function 
p(d v 8 a ) invented for the purpose. He claims that the conclusions drawn in 
this way are valid, provided that the function used is j ust the one that he suggests. 
The present author would not question this statement on condition that the word 
‘valid’, or any other such description, is not given any significance beyond that 
described above. In other words, there seems to be no reason why we should not 
agree to call the above conclusions ‘valid in the sense of Jeffreys’. On the other 
hand, it seems essential to he clear that any probability calculated from (1), with 
any function p(8 v ...,6 s ) not implied by the actual problem, need not and, 
generally, will not have any relation to relative frequencies. It will not be the 
probability in the classical sense of the word and, therefore, persons who would 
like to deal only with classical probabilities, having their counterparts in the 
really observable frequencies, are forced to look for a solution of the problem of 
estimation other than by means of the theorem of Bayes. 

This solution (Neyman, 1937, 19386) may be obtained as follows. Consider 
the case where the circumstances imply that the a’s, forming a system E, are 
random variables with the probability la w p(E j 6 X ,0 2 , ...,$„), where 6 X , d 2 , ...,8 S 
are unknown. Denote by 8(E) and 8(E) two functions of the x’s. Obviously, if 
E is random then these functions will also be random variables. 

Definitiojt 1. 7/ the functions Q(E) and 6(E) possess the property that , 
whatever be the possible value of 6 X and whatever be the values of the unknown 
parameters 8%, 6 3l 6 Sf the probability 

P{&(E)^ 1 ^d(E)\-» 1 ,6 i , ...,d 8 } S a, 


(2) 
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then we will say that the functions 8(E) and 0(E) are the lower and the upper 
confidence limits of 6 V corresponding to the confidence coefficient a. The interval 
(6(E), 6(E)) is catted the confidence interval for 8 X . 

In spite of the complete simplicity of the above definition, certain persons have 
difficulties in following it. These difficulties seem to be due to what Karl Pearson 
(1938) used to call routine of thought. In the present case the routine was estab¬ 
lished by a century and a half of continuous work with Bayes’s theorem. It may 
be useful, therefore, to give a few illustrations. 

Assume that s = 2, that f) l may have only the five values 1, 2, 3, 4, and 5, and 
that, at the same time, 8 a may vary continuously between zero and 1. To satisfy 
Definition 1, the only requirement on the functions 8(E) and 6(E) is that 

P{i(E)4-0^8(E)\i>,d 2 }sa (3) 

for all values of ft = 1, 2, 3, 4, and 5, and for 0 2 varying between (0, 1). The 
probabilities (2) and (3) are, therefore, not the probabilities of 6 1 falling within any 
limits. On the contrary, they are the probabilities of the functions 6(E) and 6(E) 
falling on both sides of a specified number These probabilities are to be calcu¬ 
lated from the given function^)® | d v 6 2 ) with the value of 6 X set equal to the same 
number d. The result must be totally independent of the values of d v ,.., 6 S and 
must equal a. 

It is known (Neyman, 19356; Feller, 1938) that in certain cases no such func¬ 
tions Q(E) and 6(E) exist. Then there are ways of modifying the formulation of 
the problem, for example, requiring that the probability on the left of ( 2 ) he at 
least equal to a, and so forth. In other cases, there will be an infinity of pairs of 
confidence limits all corresponding to the same a. In this case, the practical 
statistician is at liberty to choose among them. 

Let us now consider the frequency interpretation of the solution of the 
problem of estimation by means of confidence intervals. Suppose that some two 
functions 6(E) < 6(E) possess property ( 2 ) with some large value of a, say a = 0 - 99. 
Their use in practice would consist of (i) observing the value E' of the x’s, (ii) 
calculating the corresponding values of the confidence limits 6(E') and 6(E‘), and 
(iii) stating that the true value dq of 6 X lies between 8(E') and O(E'). The justifica¬ 
tion is simple and perfectly in line with the classical point of view of probability: 
in many applications, the relative frequency of cases in which the statement 
0(E) 5 ; < 6(E) is correct will be approximately equal to a = 0-99, whether or not 

the parameters for estimation are the same in all cases. 

The word ‘ stating ’ above is put in italics to emphasize that it is not suggested 
that we can ! conclude ’ that 6(E') ^ < 6 (E'), nor that we should ‘believe’ that 
dq is actually between 9(E) and 6(E). In the author’s opinion, the word 1 conclude ’ 
has been wrongly used in that part of statistical literature dealing with what has 
been termed ‘ inductive reasoning ’. Moreover, the expression ‘ inductive reasoning ’ 
itself seems to involve a contradictory adjective. The word ‘ reasoning 1 generally 
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seems to denote the mental process leading to knowledge. As such, it can only he 
deductive. Therefore, the description ‘inductive’ seems to exclude both the 
‘ reasoning ’ and also its final step, the ‘ conclusion If we wish to use the word 
‘ inductive ’ to describe the results of statistical inquiries, then we should apply it 
to-‘behaviour’ and not to ‘reasoning’. The factthata given pair of functions 0(E) 
and d(E) satisfies the identity (2) may be ‘deduced’ from the properties of the 
function p(E | 0 V Earlier trials may show characteristics in the empirical 

distribution of then’s which seem in agreement with the function p(E | 0 1 . 0 a ). 

On these grounds, after observing the values of the x’a in a case where the O'a are 
unknown and calculating d(E') and d(E'), we may decide to behave as if we actually 
knew that the true valued of d x were between 0(15') and O(E'). This is done as a 
result of our decision and has nothing to do with ‘ reasoning ’ or ‘ conclusion ’. The 
reasoning ended when the functions 8(E) and 9(1?) were calculated. Theabove pro¬ 
cess is also devoid of any * belief’ concerning the value 8 X of 0 V Occasionally we dp 
pot behave in accordance with our beliefs. Such, for example, is the case when we 
takeout an accident insurance polioy while preparing for a vacation trip. In doing 
so, we surely act against our firm belief that there will be no accident; otherwise, 
we would probably stay at home. This is an example of inductive behaviour. 

Obviously, if there are many different pairs of functions, 6(E) and 0(E), all 
corresponding to the same a, our choice of the one to use must he based on the 
detailed study of their properties. For example, if it appears that the difference 
between one pair, d 1 (E) — 6 1 (E), is always (or most frequently) smaller than that 
between some other pair, then we would probably prefer to use the first. The 
problem of determining the confidence limits and of studying their properties 
forms the subject of the theory of confidence intervals. 

3. Necessary and sufficient conditions for a pair oe 

EDNOTIONS TO BE CONFIDENCE LMvIITS 

Let a(E) < b(E) be any two single-valued functions of the x’a determined for 
all possible systems of their values. Denote by W the space of the £c’s and by 
one of the possible values of Q v Finally, let A($ x ) denote the region in the space 
W composed of all points E which satisfy the double inequality, 

a(E) <'& 1 <6(jB). (4) 

It was proved (Neyman, 1937) that for the two functions, a(E) and b(E), to be the 
lower and upper confidence limitsfortheparameter 6 1; it is necessary and sufficient 
that, whatever be the possible value fiq of 9 V the probability 

P{EeA(-9 1 )\8 1 = & 1 }=a. (5) 

The identity refers to the arbitrary variation of 6 S ,..., 0 g . 

This condition will be used below to show that a certain pair of functions does 
not represent the confidence limits. For this purpose, the following steps will be 
taken; We Bhall select a convenient value ■d 1 of the estimated parameter 6 t and 
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determine the region as in (4). Next, we shall substitute this same value & 1 
instead of the parameter 0 X in the elementary probability law of the variables 
considered, getting p(E\ dj, ...,8 S ). This last function will be integrated over 
A(d' 1 ) to find the probability P{EeA | 0 1 = as in the left-hand side of (5). But 
this integral will be dependent on the values of the other parameters involved, 
showing that the identity (5) is not satisfied. The conclusion will be that the 
particular functions considered are not confidence limits. 

4. Differences between the theoby op confidence intervals 

AND THE THEORY OF FIDUCIAL ARGUMENT 

In this section we will consider examples treated both from the point of view 
of confidence intervals and of fiducial argument. These will be selected to illustrate 
both the conceptual and the numerical differences between the two theories. 

(i) Evidence of conceptual differences between the two themies. The first results 
obtained concerning confidence intervals (Neyman, 1934) refer to the case where 
all the n observable variables x i are mutually independent, normally distributed, 
have the same though unknown standard error cr, and expectations S(xf) which 
are linearly connected with some s<n unknown parameters p v p 2> .. ,,p 8 , so that 

= OtiPi+ a i2pz>+ ••• + ( 6 ) 

Here'the a’s are supposed to be known and to form a non-singular matrix. Denote 
by 6 any linear combination of the same p’s, that is 

6 = b x p x + b 2 p 2 4-... + b e p ? , (7) 

with known 6’s not all equal to zero. In these circumstances, a confidence interval 
for 6 is given by F-St a ^6^F + 8t al (8) 

where F denotes the best unbiased estimate of 6 (David & Neyman, 1938), S the 
estimate of the standard error of F, and t a the value of the ‘Student’-Fishery 
corresponding to the number of degrees of freedom n—s and to P = 1 — a. The 
application of more recent theory (Neyman, 19356) shows that the confidence 
intervals (8) have distinct advantages over any others by satisfying the definition 
(Neyman, 1937) of the ‘ short unbiased system of type B t Without entering into 
these details, we shall consider the particular case where s — 1 , a a = 1 and 
b x = 1. This will be the case if all the x'& come from the same unknown normal 
population and it is desired to estimate its mean, 6 = £{x t ). In that case F — x 

* »(w—1) ■ W 

Asmentioned, the general-confidence interval (8) was discussedin lectures about 
1930, and in 1932 a publication appeared using the concept and the formula (8). 

As far as is known, the first full discussion of the corresponding result in the 
fiducial theory was given by Fisher a few years later (Fisher, 1935, 1936), and 
here is the relevant passage from the second paper. 
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If a sample of n observations, * x , x n , has been drawn from a normal population 
having a mean value ft, and if from the sample we calculate the two statistics x = 2x i /n and 
s 2 = £(xi-w) z /{n- 1), ‘Student 1 has shown (1925)* that the quantity t, defined by the 
equation 


t = 


(:c—/i)dn 


( 10 ) 


is distributed in different samples in a distribution dependent only from the size of the 
sample, n. It is possible, therefore, to calculate, for each value of n, what value of t will bo 
exceeded with any assigned frequency, P, such as 1 % or 5 %. These values of t are, in fact, 
available in existing tables (Fisher, 1925-34). 

It must now be noticed that t is a continuous function of the unknown parameter, the 
mean, together with observable values, x, a and n, only. Consequently the inequality 
2>q is equivalent to the inequality 

/iKx—stJdn, ( 11 ) 

so that this last inequality must be satisfied with the same probability as the first. This 
probability is known for all values of q, and decreases continuously as q is increased. Since, 
therefore, the right-hand side of the inequality takes, by varying q, all real values, we may 
state the probability that fi is less than any assigned value, or the probability that it lies 
between any assigned values, or, in short, its probability distribution, in the light of the 
sample observed. 

It is of some importance to distinguish such probability statements about the value of 
ji, from those that would be derived by tho method of inverse probability, from any 
postulated knowledge of the distribution of ji in the different populations which might have 
been sampled.... To distinguish it from any of the inverse probability distributions de¬ 
rivable from the same data it has been termed the fiducial probability distribution, and the 
probability statements which it embraces are termed statements of fiducial probability. 

In the next section we shall analyse the above passage in detail and show 
exactly where and how it conflicts with the classical theory of probability and 
thus with the theory of confidence intervals. Here we will mention only that it is 
ambiguous. Just this kind of ambiguity, which is also found in the earlier papers 
(Fisher, 1930, 1933), is probably responsible for a number of authors, including 
the present one, thinking that the fiducial theory and the theory of confidence 
intervals are lipked. 

In a few years it was found necessary to reinterpret formula (11). This was 
done by Fisher himself (1939 b) and, somewhat more clearly but on the same lines, 
by Yates (1939). It will be seen from the following quotation from Yates’s paper 
that the above passage by Fisher certainly does not contain everything which is 
now considered essential in the fiducial theory and that the presumption of any 
link between the latter and the theory of confidence intervals is unfounded. 
Yates’s more relevant sentences are italicized by the present author. 

While explaining the meaning of the fiducial distribution of the mean /« of a 
normal population, Yates mentions that the fiducial distribution of cr 2 is given by 

1 ( 12 ) 


o“ 


Z(*i- 

where x 1 has its usual distribution with n- 


■x)*’ 

1 degrees of freedom. 


* Aotually, of course, this result appeared earlier (‘Student’, 1908). 
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It can then be shewn that, for a value of p equal to p e and a given s, the value of x in 
subsequent samples would be as small as that observed in a fraction e of the samples, 
provided that the actual distribution of or 2 is the same as the fiducial distribution given above. 

In this form, however, the statement is open to objection on the ground.that in subse¬ 
quent samples cr may in fact be distributed in any manner, and that s will certainly vary 
from sample to sample. To avoid this objection we must frankly recognize that we have here 
introduced a new concept into our methods of inductive inference, which cannot be deduced by 
the rules of logic from already accepted methods. ...That is... the form of fiducial statement 
which is implicit in the t test as ordinarily used by practical experimenters... .It must be 
recognized as essentially different from the statement that t will exceed t e in a fraction e 
of all experiments. The latter is true for any given fixed cr or any set of er’s. The former 
(i.e. the fiducial statement, J.N.) is true for a given s when a is taken to be flduciatty distri - 

buted in the appropriate distribution _The logical difference between the two approaches 

(fiducial and inverse probability, J.N.) should, however, be recognized. The approach by 
inverse probability enables fiducial statements about p to be derived from the classical 
theory of probability, without the introduction of any new principle, but only at the cost of 
postulating a particular a priori distribution of cr. In the fiducial approach such a priori 
postulation is regarded as inadmissible, but in order to discard it a new principle, that of 
utilizing the fiducial distribution of cr, must be introduced.... Once the principle is accepted it 
is possible, given sc and s, to make formal and exact statements of the fiducial type about p 
which are independent of all prior knowledge of cr. If the principle is not accepted, then it 
appears that we must either assume an a priori distribution of cr, or deny that there is any 
possibility of making fiducial statements about p. 

The present author is unable to understand the exact meaning of what is 
called ‘fiducial statements about p\ However, his conclusion is that their con¬ 
ceptual nature must be quite different from that dealt with in the theory of 
confidence intervals. This conclusion is based on the fact that all the difficulties 
described by Yates as inherent in the fiducial theory are non-exisf ent in the theory 
of confidence intervals. Applications of the latter require no new principle 1 which 
cannot be deduced by the rules of logic no assumption that this or that unknown 
parameter follows any specified distribution, and have no connexion with Bayes’s 
theorem. To make the situation absolutely clear, imagine a sequence of normal 
populations n x ,Tt^, ..., with their means d lt 6 z , ...,d m ,... and their stan¬ 

dard deviations oq, cr 2j ...,cr m ,.... Imagine that out of each population rr m we 
have a random sample E m of n individuals, with its mean x m and an estimate of 
the corresponding variance 8 2 m as in (9). The theory of confidence intervals 
guarantees that the relative frequency with which x m —t a 8^ will fall short of the 
corresponding 6 m and, at the same time x m +t x 8 m will exceed this same number 
6 m , will be, within an error of sampling, equal to a. An incredulous reader may 
easily check this by a sampling experiment. In this he will be at liberty to keep 
6 m and/or <r m constant, or to vary them at his, pleasure, without any restriction. 
Of course, the distributions of the populations sampled should be more or less 
normal and the sampling should be random. It follows from the above passages 
of A ates that if the requirements above are satisfied and no new principles 
accepted, then we have to deny that there is any possibility of making fiducial 
statements about 0 m . If so, then the nature of the latter is different from thpse 
involved in the application of the theory of confidence intervals* 

Biometrika xxxri 10 
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The comparison of the above comments by Yates with those of Fisher gives 
a curious impression, Where Yates sees so many difficulties and restrictions, 
Fisher mentions none. Yet this very publication of Yates is fully endorsed by 
Fisher (19396). 

(ii) Numerical differences between the two theories. Besides establishing the 
existence of conceptual differences, it is essential to show that the two theories 
may give different numerical results. We may conclude from the discussion above 
that the application of confidence intervals requires fewer restrictions. But there 
is a logical possibility that, when both theories are applicable, they give the same 
numerical result. The following example shows that this is not the case and that 
fiducial limits need not satisfy the definition of confidence limits. 

The example that we are going to discuss refers to the problem of estimating 
the difference, say 6, between the means of two populations of which it is known 
only that both are normal. Denote by 

* 1.11 * 1,21 •••! * 1,9111 

*2,11 *2,21 • • •! *2,91'i-l 

two random samples to be drawn from these populations and let n^n'. The 
confidence limits for S have been very elegantly obtained by Bartlett. He did not 
publish his results himself but they are briefly mentioned in a paper by Welch 
(1938). The tendency towards a greater generality of presentation resulted in 
certain complications. The following is a less general but simplified statement of 
the results.* Assume that the *’s in (13) are numbered in the order in which they 
will be given by observation. Otherwise, randomize the second series. Next 
calculate n differences 

«i = *i,i-*2,i (i= 1,2,...,«). (14) 

If £(x Xi ) = d + S and ^(aJ 2ji ) = 0. then <?(%) = S. If the s.n.’s of the two 
populations sampled are cr and cr', then the s.e. of u t will be (cr 2 + cr' 2 )k The con¬ 
secutive u’b will be normal and independent and the problem of estimating the 
difference between the means of two normal populations will be reduced to that 
of estimating the mean of one population of the u’b. Its solution is given by the 
confidence interval 

u—St( a )^8 St( a ), (15) 

where 8 has an obvious meaning and is to be taken with n — 1 degrees of 
freedom. 

Again, an experiment consisting in repeated sampling of pairs of normal 
populations will show that, whatever be 0, S, cr, <r', whether constant or varying in 
an absolutely arbitrary manner, the relative frequency of cases in which the 
statement about S in the form of (15) will be true will be approximately equal to a. 
The above solution of the problem, elegant as it is, is only a partial one. The results 

* Apart from these, the same author has obtained certain relevant results referring to the case 
where n = = 2 (Bartlett, 1936). 
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of Bartlett do not tell us whether the family of systems of confidence intervals 
found by him exhausts all the possibilities and whether it is possible to construct 
intervals which would be, in one sense or another, shorter than those given by 
(15). These aTe interesting and important problems and we may hope to have 
them solved. 

A result in fiducial theory corresponding to, but not equivalent with, formula 
(15) has been published by Fisher (1936): 


Let us suppose that a sample of n observations has yielded a moan, x, and an estimated 
variance of the mean, a 2 , so that s 2 = S{x t — x) 2 jn(n — 1); then we know that if // Is the mean 
of the population 

/i — x+at, (16) 

where t is distributed in ‘Student’s’ distribution. Similarly, for the mean of a second 
population, of which we have n' observations, we may write 

jl' = x'+8't', (17) 


where t' is distributed in ‘Student’s’ distribution with n' — l degrees of freedom, inde 
pendently of t. If now 

r = 8, x’-x = d, 


we find that 


J —d = s't' — st, 


(18) 

(19) 


and since, s' and s are known, the quantity represented on the right has a known distribu¬ 
tion, though not one which has been fully tabulated. The equation may he written 


e s= 4(s* +s' a ) ( t' cos R — t sin R), (20) 

where tan jR = s/s', so that It is a known angle. If t and V be taken as the co-ordinates of a 
point on a plane, the frequency of the observations falling within any area of tho plane is 
calculable. The points for which e has any given value lie on a straight lino, at a distanoe 
from the origin ± e/(s 2 -f s' 2 ) 1 , and making an angle R with the axis of t. The fiducial prob¬ 
ability that e exceeds any given value is the frequency in the area above this line. If n 
and n' are both increased, the distribution of e tends to be normal and independent of R ; 
when B is 0° or 90° the distribution is of ‘ Student’s ’ form. In general it involves n, n', and 
R and for any chosen probability, therefore, requires a table of triple entry. 

As the reader will notice, no restrictions are mentioned and it is not suggested 
that for the practical application of the results any assumption is needed con¬ 
cerning the variability of the variances of the populations sampled. Neither is 
there any suggestion of any new principle that may he involved. We will return 
to this point below. 

Following the publication of Fisher just quoted, and on his advice, Sukhatme 
published a table (Sukhatme, 1938). The quantity tabled may be denoted by 
f(n, n', R) and represents the root of the equation 


n C + to 

0( i ‘ 


H(t')dt'\dt = 0-025, 


( 21 ) 


where G(t) and If (f') are ‘Student’s’ distributions with n -1 and A -1 degrees of 
freedom respectively, while 

(22) 


K = 


(.s 2 + s' 2 )* cos R 


10-2 
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It follows from the context that/(», n', R) so calculated is the value such that 
the fiducial probability of its being exceeded by | e |/(s 2 -M' 2 )l is equal to 0-05. 
In other words, the values f(n,n', B) are the fiducial 5% limits of | e |/(s s + s' 2 ) i . 
As e = 8-d, if the presumption that the fiducial limits necessarily lead to 
confidence intervals be true then this means that the double inequality 

x'—x -f(n, n', R) fis *+ s' 2 ) ^ 8 ^ x' ~ x +/ (n, n', R ) ,/(s 2 + s' 2 ) (23) 

must be the confidence intervals for 8 — fi~fi. But it is easy to see that the 
functions on the extreme parts of (23) do not satisfy the conditions, explained in 
§3 above, necessary and sufficient for them to be the confidence limits. Take 
5 = 0 and denote simply by A the region in the space of the x’s including all the 
points in which the inequality (23) is satisfied. Take the probability law of the 
x’b and put S = 0 in it, that is, fi ~ //,. It will be seen that the integral 1(A) of this 
probability law taken over A depends on the ratio p — crjc' of the two tr’s appro¬ 
priate to the two populations sampled and, thus, that it does not satisfy the 
identity (S). 

Condition (23) defining the region A does not involve the particular x’s but 
only the means x, x', and the variances s 2 and s' 2 . Consequently, to calculate 
1(A) we may start with the probability law of those four variables 


p(x,x',a,s') ■■ 




x exp 


<rV w ' 

w(a--/t) 2 n‘(x'- 




«(w-l)s 2 n'(fi— l)s' 


2cr a 2cr( 2 2cr a 2</ 2 


■], <«> 


where c is a purely numerical constant and does not involve any of the parameters. 
This function must be integrated over the region A defined by (23) or by the 
equivalent inequality 

^=|L</(n,*',2J). (25) 

In dealing with it, we have to remember that B is not a constant but is connected 
with, s and s' Joy the equation tan B = a/s'. The required integral, or probability, 
of x, x', a, and s' satisfying (25) will be more easily calculated if we introduce a new 
system of variables, u, v, R, and s 0 , These will be connected to the old system as 
follows: 

« = p + us^ainR,' 
x' = p + vs 0 cos B, 
s = s 0 sin B, 
s' = s 0 cos E. 

The Jacobian J of the transformation is easily found to be 

J ~ sg sin R cos R. 


(26) 


ik 


(27) 
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The limits of variation of the new variables are as follows: 


— 00 <u,v< + 00 , 

0 ^5(j) 

0 < R < \n. 

The probability law of the new variables will be 


p(u, v, s,„ R) = —rr-f-,SQ +n '~ 1 er l( ^ 8 o> sin " -1 E cos n '~ 1 B, 

(T fl (T 

... .„ nv? sin 2 R n'v 2 cos 2 jR n(n— 1 ) sin 2 It n'ln’ — 1) cos 2 R 

with ^ a =-s—~ +-+ --v-+ —- k -■ 

cr 2 a 2 cr 2 <r a 

Inequality (26) will be equivalent to 

| v cos B - u sin R | <f(n, n\ R). 


(28) 


(29) 

(30) 

(31) 


As this does not involve s 0 the integration with respect to this variable can be 
carried out within the extreme limits of its variation. As a result further integra¬ 
tions may be performed on the probability law of w,‘v, R, 


I* CO 

p(u, v, R) = I ^ p(u, V, s 0 , R) ds„ 


c sin ” -1 R cos ’ 1 ' -1 R 


ntr'n' 


(T n cr 


jjf-n+n’ 


(32) 


where o is again a numerical constant. 

Further integration may be conveniently carried out as follows. Substitute 
a new variable z for the variable v so that 


v = 


z + u sin R dv 


cos R 


CZ COS R 


(33) 


Keep z constant within the limits | z | ^f(n, n', R) prescribed by (31) and integrate 
for u from - oo to + oo. The result is 


p{z, R) 


c sin u-2 R cos ' 1 ' -2 R 
(jn-lfj-'n'-l ^(no-" 1 + ftV 2 ) 


nn' „ w(n-l) . ._ «/(«' — 1 ) „ 

z 2 H—-— 5 —- sm 2 R + —— 7 ^—- cos 2 R > , (34) 


ntr* + n cr 


'rr2 ' 


The integration is completed by an easy substitution for z 


1(A) = cp n '~ l 
*1 »/ 


-r. 


sin 11-2 R cos ' 1 ' -2 R 


(n(n — 1 ) sin 2 R+n'(n' — 1 ) p 2 cos 2 J 0 (1 + z 2 ) i(?l+ "'' 


p* dz \ D 
'~ 2 >Jo (l + 2 2 )«»+«-'-D| ft ^ ) 


(36) 
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with/=/(», n', R) and 


w 6 


I (•&=}!*? cos’ if). 

ncr 2 + «. cr 2 /\ <x 2 cr 2 / 


(30) 


By inspecting (35) it is more or less evident that 1(A) must depend on the value 
of p, However, to avoid any doubt in this respect, it was thought useful to 
calculate 1(A) for a few values of p. This was done by Miss Elizabeth Scott of the 
Statistical Laboratory, University of California, and it is a pleasure to record the 
author’s indebtedness to her. The calculations involved supplementing the tables 
of Sukhatme for a denser set of values of R. The calculated values of 1(A) are: 


72 . — 12, n' = 6 


p 

1(A) 

0-1 

0-966 

1-0 

0-960 

ion 

0-934 


Thus the functions representing the fiducial limits for 8 do not satisfy the condi¬ 
tions necessary and sufficient for them to be the confidence limits of the parameter 
in question. It follows that if pairs of normal populations forming a long sequence 
are sampled and the extreme parts of the double inequality (23) calculated, then 
the relative frequency of cases where the prediction of the value of 8 by means 
of these inequalities will be correct need not be equal to the expected 0-95. It will 
depend on the value of p and, if this is uncertain, this frequency will be unknown. 
Subsequent comments by Fisher (Fisher, 1939 a) seem to indicate that the fre¬ 
quency in question is expected to approach 0-95 only if the ratio p is not constant 
but follows a certain fiducial distribution. It is noteworthy that no such restric¬ 
tion is to be found in the original work quoted above. On the other hand, it is 
more or less in line with those restrictions formulated by Yates. 


5. Views or M. S. Bartlett and R. A. Fisher 

The controversy in which the main contributors are Bartlett (Bartlett, 1936, 
1939) and Fisher (Fisher, 1937, 1939a, 19396) seems to be based on a mis¬ 
understanding. Presuming that the fiducial limits are always equal to confidence 
limits, Bartlett was puzzled by Fisher’s results concerning 8 just quoted, and 
suspected an error. The subsequent elaborations by Fisher and Yates amount to 
a confirmation that the values of / (n, n !, R) as tabled by Sukhatme do not provide 
the confidence intervals. But both authors are emphatic that there is no error in 
the original deductions, and that Bartlett misunderstood the problem. It is 
unthinkable that these four unanimous papers are mistaken and, therefore, we 
must accept the conclusion that the presumption of intrinsic identity between 
fiducial and confidence limits is unfounded. 

But it must be pointed out that, before the appeal to extra-logical principles 
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was published, there was much to be said in favour cf the opinion that the solution 
of Fisher, as quoted above, and the work of Sukhatme both involved errors in the 
algebra of probability laws. It also seems that, apart from establishing that the 
fiducial theory and the theory of confidence intervals are distinct, it will be of 
some interest to analyse Fisher’s work in detail and to point out exactly where 
and how it diverges from the rules of ordinary theory of probability on which the 
theory of confidence intervals is based. 

When a system of observable phenomena is treated mathematically, it is 
essential to be clear on exactly what is assumed as given or as known. For example, 
when trying to calculate the area of land from a certain set of measurements, it is 
essential to be clear as to assumptions made concerning the shape of the land 
considered. The available data may be consistent with a number of such assump¬ 
tions, e.g. that the surface considered is a plane or that it is spherical with a given 
radius, etc. Whichever of these hypotheses is accepted as given, the applications 
of the appropriate formulae will give mutually consistent results. But they would, 
not generally be consistent if one part of the calculations were made on one 
hypothesis and another on a contradictory one. The differences may be small, but 
in mathematics there are really no ‘ small ’ nor ‘ large ’ inconsistencies. There are 
simply inconsistencies. Needless to say, the choice of exactly what is to be 
accepted as given must be made to attain the greatest conformity with empirical 
facts. But this is a question which need not be discussed here. 

The above general principle also applies to the applications of probability. 
There we must be clear as to exactly what are the phenomena or the variables 
which we agree to consider as random in a given inquiry. In practice, of course, 
the random variable will be the one whose value at the moment is uncertain and 
is being determined ‘by chance’. If X is considered as a random variable, the 
premises of the mathematical problem must include some assumptions as to the 
relative frequencies with which X assumes its possible values. These assumptions 
may vary in specificity, but they must be present in the premises. 

Any number or variable which is not random must be clearly recognized as 
such. For some time such non-random numbers were called constants. This was 
more or less satisfactory with constant numbers. But Frechet (Frechet, 1937) has 
noticed that we may also consider variables which are not random and has in¬ 
vented useful terms to describe them. These are ‘nombre certain’, ‘fonction 
certaine ’, etc. We will translate these terms by ‘ sure number ’ and ‘ sure function ’. 
The thousandth digit in the expansion n = 3-1415... is a sure number, although 
totally unknown to me. Denote by f(n) the relative frequency of 0’s among the 
first n digits of the same expansion of it. This will be a sure function. On the other 
hand, if (pin) denotes the number of errors that may be made when calculating tt 
to n places of decimals, then <p{n) may be considered as a random function of n. 
Considerations of this kind would imply those of a considerable sequence S of 
similar attempts to calculate tt, by the same person or by different persons of a 
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specified category, in which the values of <f>(n) will vary, as we shall say, at random. 
It is with respect to just such a sequence of determinations of the values of the 
function <j>{n) that our probability statements will refer. For example, if we either 
start or finish our calculations with the probability equal to 0-25 of §5(n) being 
between any two sure numbers a, b, then the applicational statement is that about 
25 % of the numbers of the sequence S satisfy the inequality a <b 

It is important to notice that the sequence S may consist of just one member; 
then all the proportions relating this ‘sequence’ will have to be either 0 or 1. In 
other words, if the sequence of ‘random’ determinations consists of just one 
element, this element will have the property of a sure, not a random, object, in the 
usual sense of the word. 

Now let us turn to the passage from Fisher’s paper quoted above, p. 136, and 
try to see exactly what is supposed to be random there and what elements of the 
problem are treated as sure numbers or sure functions. These details in the set-up 
are not stated at the outset, but there is no difficulty in collecting them from 
appropriate passages in the paper. We first see that the function t of (10) is sup¬ 
posed to be ‘distributed in different samples...’. This means that t is a random 
variable and that its randomness depends on what is found in those repeated 
samples, namely, the values of x and s. It follows that the probabilities concerning 
x , s , and t refer to the sequence S of those ‘different ’ samples. The sequence could 
not consist of just one sample because, in such a case, the ‘ distribution’ of t would 
not be anything like ‘Student’s’ law. The references to a normal population 
sampled and to ‘ Student’s ’ law indicate, on the contrary, that the sequence S 
of samples is very large indeed, and that the distributions in it are comparable to 
those represented by continuous curves. 

Up to this time we have, not mentioned the population mean y which is also 
involved in the expression of t. Obviously, this may be treated mathematically 
either as a random or as a sure number. Both methods of approach are at our 
disposal but, in order to avoid inconsistencies, we must be clear as to which one 
we follow. The indication of Fisher’s choice is found a little further on in this 
article, in the place describing the distinction between the fiducial and the inverse 
probability approach: ‘It is of some importance to distinguish such (fiducial) 
probability statement about the value of pi, from those that would be derived by 
the method of inverse probability from any postulated knowledge of the distribu¬ 
tion of y in the different populations which might have been sampled.’ T his 
sentence does not seem to leave any ground for doubt, In the fiducial approach 
we consider but one population sampled and no distribution of y is postulated. 
Therefore, y is a sure number and, if t is distributed according to ‘ Student’s ’ law, 
it is a result of the appropriate variability of x and s alone. 

The symbol t v which also comes into play, is obviously a sure variable capable 
of any real value between — co and + go. We may select it as we wish and then 
obtain the probability of the random variable t exceeding t L from tables. 
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Following the article, we will readily agree with Fisher that the inequality ( 11 ), 
namely, fi<x — stj^jn, is equivalent to t>t 1 and that it must he satisfied with some 
probability P(i t ). Now consider the phrase: ‘ Since, therefore, the right-hand side 
of the inequality (i.e. x — stj.fn) takes, by varying t v all real values, we may state 
the probability that /i is less than any assigned value, or the probability that it lies 
between any, assigned values, or, in short, its probability distribution in the light 
of the sample observed.’ From the point of view of ordinary logic and of ordinary 
theory of probability this phrase is inconsistent with the original set-up. The first 
inconsistency is involved in the words which are italicized, suggesting that % and 
s in the expression x — st x /^Jn are not random but sure numbers, referring to one 
particular observed sample. As a matter of fact this same inconsistency appears 
earlier in the statement that x—stj^jn, by varying t x . will run through all real 
numbers. If, as formerly, x and s are random with their variation appropriate to 
the sequence S, then, whatever value we choose to ascribe to t v say t = 2 , the 
expression x-2s/*Jn is also random and depends on the outcome of sampling. 

Apart from this sudden shift in the meaning ascribed to x and s, there are two 
more inconsistencies. To see the first of them, let us follow Fisher, changing our 
minds about x and s and considering them as sure numbers, determined by one 
particular sample. In this case the inequality /i<x — stjjn would contain no 
random elements at all: the first element, /t, is an unknown constant, the mean of 
a single population sampled, x and s are fixed by the sample observed, and t x 
is the value of the sure variable that we have chosen to consider. In these circum¬ 
stances, the inequality may either be true or not true and the probability of its 
being true will equal unity or zero and have nothing to do with the probability 
or frequency P(tf) which this same inequality satisfies within a sequence 8 of many 
‘different’ samples. 

The last inconsistency refers, of course, to the point of view on [i. As we have 
seen above, it is first considered as a sure number, but the passage just quoted 
speaks of the probability of its lying between any assigned limits possible to 
determine from the values of -P(t). Assume n = 4 and that the sample observed 
gives x = 10 and s = 2. Select t x — 0-765 and t[ = — 0-765 so that P(t x ) = 0-26 and 
P(t' x ) = 0-75. This would result in the supposed probability P‘ of /i lying between 
the limits 9-235 < 10-765, being equal to 1/2. Trying to interpret this result in 

the light of the classical theory of probability, we have to conceive a sequence, 
say S', of cases in 50 % of which fi falls between the above limits. But exactly what 
could this sequence be? Either there is such a sequence and then we must also 
consider other populations ‘ which might have been sampled and postulate some¬ 
thing about the distribution of fi*, or else the ‘ sequence ’ must be the degenerate one 
of one element only with the probability P' equal to either zero or unity, but 
never to 1/2. 

These are the points previously mentioned by the author (Neyman, 1934), 
* This is quite essential. Otherwise there would be an error in Bayes’s theorem. 
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which, from the point of view of classical probability, represent conceptual in¬ 
consistencies. They are also present in the other passage of Fisher quoted on 
p. 139, but a similar analysis of that passage supplemented by what has subse¬ 
quently been done by Sukhatme, will reveal errors in algebra of probability laws 
as well. These errors are particularly relevant from the point of view of the contro¬ 
versies between Bartlett and Fisher. 

The quantities considered in this passage are all dependent on the population 
means /< and /// and on the statistics x and s of one random sample and on x' and 
s' of the other. Our analysis will also require the consideration of the population 
variances <r 2 and <r' 2 . We must start by deciding on the random or sure character 
of all these quantities. Fisher’s remark that the two ratios 


t 


^ and 


(37) 


are distributed according to ‘ Student’s 5 law with appropriate degrees of freedom 
suggests that fi and ft 1 are treated as sure numbers and that x, x', s, and s' are 
random. There is no reference whatever to the variances cr 2 and <r' 2 . As nothing 
is disclosed about what distribution they may possess, by analogy with the y's 
it is natural to treat them as sure numbers also. 

In order to interpret every step in calculations more easily, we shall imagine 
two normal populations Tf t and tt 2 sampled and a sequence A of pairs of samples, 
of n and n- individuals respectively, drawn independently from n 1 and tt 2 . These 
pairs of samples will determine x, s, x', and s', generating distributions appro¬ 
priate to normal populations. Substituted into formulae (37) they will make t and 
t' vary to generate the two distributions of ‘ Student’. 

With this in mind, let us examine the passage in which Fisher writes 

e = 5—d = aY — st, (38) 

and comments: ‘ Since s’ and s are known, the quantity represented on the right 
has a known distribution, though not one which has been fully tabulated, 5 We 
see here the same kind of sudden jump in the point of view on quantities con¬ 
sidered as is found in the passage analysed previously. Formerly s' and s were not 
‘ known 5 but random. Otherwise, the distributions of f and t' would not have been 
those of ‘ Student ’ but would have been normal about zero and due solely to the 
variability of $ and x'. Now s’ and s are known sure numbers. Let us allow for this 
shift in conditions and try to visualize the character of the distribution of e for 
fixed s' and s. F or this purpose we have to consider not the whole sequence A of pairs 
of samples mentioned above, but only a subsequence B composed only of those 
pairs of samples in which the estimated variances have the same values s and s' as 
the ones supposed to be ‘ known The variability of e in the subsequence B will be 
the result of the variability of x and *' only. It is known, that the mean of a sample 
from a normal population is independent of the sample variance. Consequently 
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the distributions of x and x' in B will be normal. As the connexion between e on 
one hand and x and x' on the other is linear with constant coefficients, it would 
follow that the distribution of e in B would be normal also. Therefore, it is with 
some surprise that one reads Fisher’s suggestion that this distribution has not 
been fully tabulated. Evidently, when writing the sentence quoted, Fisher had 
something else in mind, probably depending on the new extra-logical principle 
described in subsequent publications. However this may be, we have to note the 
conflict between the sentence quoted and the rules of ordinary logic and of the 
classical theory of probability. 

The distribution of e by itself does not play any further role in Fisher’s work. 
Instead he and, subsequently, Sukhatme consider the ratio that we will denote by 
z■— e/^j(s 2 +s' 2 ). Fisher does not write any formula representing the supposed 
distribution of z and we have to look for the details of his ideas in Sukhatme’s 
paper. Complimentary references to this paper in subsequent publications by 
Fisher suggest that it is perfectly in line with his own ideas. We quote the relevant 
sentence in Sukhatme’s paper, only altering his notation to bring it into agreement 
with that of Fisher. 


He (Fisher) considers the distribution of 


a 


Visas'*) 


cos B — t sin B , 


(39) 


for given n, n', and B in order to obtain the probability that z exceeds any given 
value. 


It is obvious at once that the probability in question does not refer to either of 
the sequences A or £ visualized above. The appropriate sequence 0 of pairs of 
samples to which this probability refers is a part of the sequence A composed of 
all such pairs of samples in which the variances s 2 and s' 2 , while variable, keep the 
ratio s/s' = tan R = constant. Mathematically, the distribution sought is known 
as the relative distribution law of z given F and is denoted by p{z\ R). If p(R) and 
p(z, R) are the absolute probability law of R and the absolute joint probability 
law of z and R, respectively, then, for every R such that p(R) > 0, 


p(z | R) = 


P(2> R) 

p(R) ■ 


(40) 


The relative probability, given R, of z exceeding a fixed number z v that is 
P(z >z x \R), will be obtained by integrating (40) for z from z x to + oo. There is an 
alternative way of obtaining the same probability. This consists of first finding the 
relative joint probability law given R of t and t'. If this is denoted by p{t, t' j R) 
then 

P{z > z x | R} - f f p(t, t' | R) dt dt', 

J J W(Zi) 


(41) 
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where the region of integration w(z 1 ) is determined by the inequality 

z = f'cos R-tsinR>z v (42) 


A familiar formula gives 


p(t, i’ | R) 


p(R) 


(43) 


Whichever way, (40) or (43), is preferred, the resulting probability P{z > z x | R} 
will have the same value and will refer to the sequence C described above. 

Sukhatme has chosen to apply a quadrature procedure to calculate the integral 
(41) with the integrand equal to the product of two of ‘ Student’s ’ distributions with 
n - 1 and n‘ - 1 degrees of freedom respectively. This is just the error in algebra of 
probability laws mentioned above. The t and t' are distributed independently and 
in accordance with ‘ Student’s ’ laws only in the sequence A where both the means 
H and x' and also the variances s 2 and s' 2 are undisturbed in their random and 
independent variation appropriate to samples from normal populations. When 
calculating the probability ‘for a given R’, we do not consider the sequence A 
but only its part C so selected that the ratio sjs' is constant. This selection disturbs 
the original distribution of a and s' and is reflected in the resulting joint distribu ¬ 
tion of t and t'. 

In our calculations above (26) we have used the letters u and v for what is here 
denoted by t and t'. Consequently, the joint probability law p(t, t\ R) is obtained 
from (32) by merely substituting t for u and t' for v. The absolute probability law 
of R is easily obtained by integrating (34) with respect to z between the limits 
-oo and +oo. The result is 


„ M ,_, sin ” -2 R eos n '“ 2 R 

2 H-K) - c P _ 1 ) sin 2 R + n'{n' — 1 ) p 3 cos 2 _ft}K*+ n '- 2 > ’ 


(44) 


with c denoting a numerical constant. Substituting (32) and (44) into (43) we 
obtain 


p(t,t' | R) 


_ m,p) _ 

{ n(t 2 + n — 1) sin 2 R + n' (f ' 2 + n' — 1) p 2 cos 2 ’ 


(45) 


with <f){R,p) denoting a function of R,p,n and n' only. p(t, t' j R) is just the func¬ 
tion to be integrated to obtain the relative probability given R of t and t' to verify 
any inequality such as t' cosR-tain R>z v As one would expect p(t,t'\R) 
appears to depend not only on R but also on the ratio of the population vari¬ 
ances p 2 . 

It follows that, from the point of view of the ordinary theory of probability, 
the Pisher-Sukhatme solution is wrong. The error consists in their confusing the 
absolute probability law of t and obtainable by integrating (32) for R, with the 
relative probability law given R of the same variables as given by ( 45 ). Some such 
error seems to have been suspected by Bartlett. Repeated denials and the re¬ 
ference to the extra-logical principle underlying the fiducial theory lead us to 
believe that from the point of view of that particular theory the error is non- 
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existent. While accepting these explanations we may still regret that the earlier 
papers by Fisher and that of Sukhatnae do not contain any clue as to how they 
are to be interpreted. 


6; Summary 

1. The theories of fiducial argument and of confidence intervals differ in their 
basic conceptions. The validity of the former requires, at least in some oases, the 
fulfilment of various restrictions of which the theory of confidence intervals is 
totally free, and/or the acceptance of some new principles impossible to deduce by 
the rules of ordinary logic (Yates, 1939; Fisher, 19396). 

2. The two theories may occasionally give the same numerical results in the 
form of fiducial limits on one side and of confidence limits on the other. The pro¬ 
blem of estimating the difference of means of two unknown normal populations 
shows, however, that this need not always be the case and that fiducial limits need 
not satisfy the definition of confidence limits. 

3. Bartlett’s criticisms of Fisher’s solution of the problem just mentioned 
seem to be due to his considering the problem from the point of view of ordinary 
theory of probability and ordinary logic. In this light Fisher’s solution does 
contain both conceptual misunderstandings (originally pointed out in the author’s 
paper of 1934) inherent in the very concept of fiducial distribution of a parameter, 
and errors in algebra of probability laws. Since the first references to the new 
principles outside of ordinary logic, which supposedly justify the fiducial theory, 
were published after the publication of Bartlett’s criticisms, the latter seem to he 
perfectly justified and useful. 

4. Owing to a certain flaw in the ideas underlying the fiducial theory which is 
noticeable in passages quoted in §4, it is impossible to insist on any definite 
attitude towards it, except that of doubt. It may be useful, however, to express 
the following conjectures which seem to be very probable. If they are wrong then 
they will be put right and, as a result, the situation will he clarified. 

The present author is inclined to think that the literature on the theory of 
fiducial argument was born out of ideas similar to those underlying the theory of 
confidence intervals. These ideas, however, seem to have been too vague to 
crystallize into a mathematical theory. Instead they resulted in misconceptions 
of ‘ fiducial probability ’ and 1 fiducial distribution of a parameter ’ which seem to 
involve intrinsic inconsistencies as described in § 5. In this light, the theory of 
fiducial inference is simply non-existent in the same sense as, for example, a theory 
of numbers defined by mutually contradictory definitions. 

In earlier stages when-the problems treated were very simple, the fallacy 
involved in ‘fiducial probability’ was not apparent. Later on, however, diffi¬ 
culties appeared and the new principle ‘which cannot be deduced by logic’ seems 
to have been invented to disentangle them in one particular case. But the word, 
‘principle’ implies some.generality, hence the drift in comments on the samfe 
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subjects treated in 1936 and again in 1939. From the point of view of the direction 
of this drift it is perhaps significant that Yates speaks of ‘ fiducial statements 
possible to make on the ground of probabilities a posteriori and that the paper by 
Jeffreys which professes the equivalence of fiducial theory with that of inverse 
probability appeared in the Annals of Eugenics , edited by R. A. Fisher. 

However this may be, the only thing that the present author ventures to 
profess is that the theory of fiducial probability is distinct from that of confidence 
intervals. 
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The Incomplete Beta Eunction ratio has been defined as 


4 (JP.S) = B x (P><l)IB(p,q) 


B(p + q) U 1( 

'mn j)J.* ( m) 


(i) 


When the fundamental Tables of the Incomplete Beta-function (6) were pub¬ 
lished by Karl Pearson in 1934 it was realized that they might form a basis for 
shorter tables suited for use in special problems. One such application is in con¬ 
nexion with sampling theory and the associated significance tests. In carrying 
out these tests it is generally considered that a table giving values of the argument 
corresponding to certain convenient probability levels is more useful than one 
in which the probability integral is listed at equal intervals of the argument. 
Using the transformed variable 


2 = Uog, 


g(l-s) 

qx ’ 


( 2 ) 


R. A. Fisher (3) was the first to provide tables of this character, giving values of 
z associated with the 005 and 0*01 probability levels. Since then, a table for the 
0-001 level has been calculated by Colcord & Deming(2) and one for the 0-20 level 
by H. W. Norton (in Tables edited by Fisher & Yates (4)). In the terminology of 
the analysis of variance, if 

S t is a sum of squares depending on v x degrees of freedom, and 
S 2 is a sum of squares depending on v 2 degrees of freedom, 

then x = 8^S 2 (3) 

and = 2q, v 2 = 2 p. (4) 
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For tests of significance, z is easily computed and the fact that, when v l and 
v % are large, it tends to be normally distributed about zero with a standard 
deviation in / i i \ \ 

( 6 ) 




lent considerable weight to its tabulation rather than that of x. Experience has, 
however, shown that in a number of problems it is the percentage levels of * of 
the Beta-distribution that are directly required; this fact and the desirability of 
having available a greater number of percentage levels* are reasons for the issue 
of the present tables giving five significant figures for x. They may be regarded 
as a supplement to the original 1934 Tables [Q). 

Conversion from x to -z or to Snedecor’sC?) F, where 


F = = 


( 6 ) 


is straightforward. Tables of the seven percentage levels for F have, in fact, 
been already computed, and it is hoped to. include them in a new edition of 
Tables for Statisticians and Biometricians. 

Since the completion of the marginal columns for the tables of F involved some 
fresh computation, it seemed useful to extend the work so as to provide new 
tables giving thirteen percentage levels for yf. These tables are printed in a 
separate contribution on pp. 187-191 below; they have been calculated to six 
significant figures and cover the range of degrees of freedom v — 1(1)30 and 
40(10)100. 

A word of comment is perhaps desirable as to the introduction of the notation 
v, and for degrees of freedom in place of the customary n, n x and n 2 . The use 
of the letter n, both with and without a subscript, to denote a group frequency has 
been so long established in publications associated with Biometrika and elsewhere 
that it seemed desirable in these tables to avoid confusion by adopting a fresh 
symbol for degrees of freedom. The letter / has sometimes been used, but the 
notation is not altogether satisfactory; the letter v is that employed by Yule & 
Kendall ((8), p. 415), and its use here should be free from any ambiguity. 

Reference has been made above to the existence of problems where the direct 
requirement is for the percentage levels of x rather than 2 or F. A case in point is 
that of the multiple correlation coefficient in samples from uncorrelated normally 
distributed material; here B 2 follows exactly the Beta distribution. In other 
cases the distribution may be used to give an approximate fit to probability 
functions whose exact equations are either unknown or difficult to handle. Thus 
in his Preface to the Tables of the Incomplete Beta-function , Karl Pearson stated 
that his first interest in the function was stimulated by the discovery of how 
accurately it could be made to graduate a hypergeometric distribution. The 

* The percentage levels tabulated are: 50, 25, 10, 6, 2-5, 1 and 0-5. 
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fitting was carried out by equating the first four moments of the Beta and hyper¬ 
geometric distributions. 

Again, if a random variable can assume only values between 0 and 1, if it has 
a mean value of fi' x and a second moment about zero of //.(>, then the probability law 

= T(pj7W) ~ a5)4-1 ’ (7) 

where p = /*?), Q = (l“/4)(/4-/4)/(/4“/ti a ), ( 8 ) 

will often give a very close approximation to the true law. Use has been made of 
this fact by Neyman & Pearson (5), Bishop (i) and others in determining prob¬ 
ability levels for the likelihood ratio criterion L x used in testing the homogeneity 
of a series of variances and covariances. The accompanying tables are directly 
applicable in such problems. 

Miss Catherine M. Thompson (now Mrs V. G. Grylls) has been responsible for 
by far the greater part of the numerical work involved in the production, and the 
tables should rightly be associated with her name. Owing to the special character 
of the Beta-distribution, which makes it necessary to vary the method of com¬ 
putation in different parts of the range of variables covered, a considerable 
amount of exploratory work and some careful planning was needed in the 
development of the lines of attack. Tins essential aid has been provided by 
Drs L. J. .Comrie and H. 0. Hartley of Scientific Computing Service Ltd., in 
whose office Miss Thompson carried out.most of the work. Since the evacuation 
of University College at the beginning of the war this help, both in advice and in 
accommodation, has been more than ever essential. In the following pages 
Drs Comrie and Hartley have described the various methods used in computation 
and have also discussed the problem of interpolation. 

The Editor is glad to take this opportunity of expressing his warm appreeia-' 
tion of this collaboration, which has made it possible to carry through to a suc¬ 
cessful conclusion a piece of work that had long been in view. 
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Percentage points of the incomplete beta-function 

DESCRIPTION OP THE CALCULATION 
By L. J, COMRIE and H. 0. HARTLEY 


Introduction 

In terms of Karl Pearson’s notation the incomplete Beta-function B x (p, q) is 
defined by the integral 

B x (p,q) ~ J (1) 

For a: = 1 we have the complete Beta-function B x (p, q), commonly denoted by 
B(p, q) and defined by the equation 


B{p,q) 


r(p)r(q ) 


( 2 ) 


P(f + q) ’ 

which is identical with (I) for x = I. 

The tables give the percentage points of the ‘normalized’ incomplete Beta- 
function Plv + a) f® 


(3) 


They are defined as the roots x of the equation 

4(i>,«) = -P (*) 

fox given P, asfunctions of the parameters p and q. Seven tables have been pre¬ 
pared corresponding to seven selected values of P, namely 0-005, 0-01, 0-025, 
0-05, 0-10, 0-25 and 0-50. From the formula 


( 5 ) 

the roots of (4) follow immediately for P = 0-75, 0-90, 0-95, 0-975, 0-99 and 
0-995. 

In each table x is tabulated for 


v x = 2$f = 1(1)10, 12, 15, 20, 24, 30, 40, 60, 120 and oo 
v 2 = 2p = 1(1)30, 40, 60, 120 and oo 

With 2g as column heading and 2 p as row headings the arrangement of the 
tables corresponds to that of the upper percentage points of R. A. Fisher’s z and 
G. Snedecor’s F, p l and v 2 being the degrees of freedom. 

Karl Pearson in his introduction to the Tables of the Incomplete Beta-f unction (0) 
says: ‘No single method has hitherto,been discovered for evaluating numerically 
the incomplete Beta-function for all values of p and q.’ Those who have done 
numerical work on this function and its various transformations will agree that 
the main difficulty is the limitation in scope of any single method and the variety 
of methods required to deal appropriately with the range of the parameters p 
and q and of the variable a:. This difficulty is enhanced when the task is the 
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calculation of percentage points of x rather than the tabulation of the function 
- 4 (PiQ)' A large number of numerical processes, each specially designed to deal 
with certain ranges of the new tables, had to be employed to accomplish this task. 

The choice of a suitable method is largely determined by three important 
factors: 

(a) The accuracy required for x. This was fixed at five significant figures. 

(b) Existing tables available as a starting point. These are: Karl Pearson’s 
Tables of the Incomplete Beta-function^). Fisher’s tables of percentage points 
of z[i), and corresponding tables of its transformation F(S), arid finally Karl 
Pearson’s Tables of the Incomplete T-function {5). 

(c) The number and relative position of percentage levels, P, and the values of 
p and q for which the percentage points x are to be calculated. 

The importance of (a) and ( b) is obvious, but (c) is no less relevant. It will, as 
a rule, be uneconomical to produce an interpolable table of I x [p, q) merely to 
obtain a single percentage point by inverse interpolation. If, however, a larger 
number of percentage levels is to be calculated the method becomes worth while. 
In this connexion it should be remarked that the original plan was to produce 
tables for P = 0-005, 0-01, 0-05 and 0-10 only, and that it was decided at a later 
stage to add tables for the remaining values of P. 

Summary or numerical methods employed 

( 1 ) Inverse interpolation in Karl Pearson’s tables. A large number of per¬ 
centage points were obtained by inverse interpolation in the tables of I x {p, <f). 
The particular tables required were differenced on a National machine ( 2 ) and 
six significant figures of x found by the method of inverse interpolation described 
by L. J. Comrie( 2 ), taking into consideration the higher order differences. The 
method breaks down when for large p and small q (or for large q and small p) the 
tabular interval of 0-01 is too wide for the tables to be interpolable. With the 
notation adopted, the difficulty arises in the top right-hand corner of the tables 
when for large q and small p the root x is smaller than 0-05. Roughly speaking, 
roots greater than 0-05 could be obtained from Pearson’s tables. 

(2) Interpolation in tables of percentage points. It will be noted that in the 
present tables percentage points are given for certain values of p and q for which 
the function I x (p, q) has not been tabulated by Pearson. Such points occur in the 
rows 2 p = 23(2)29, 120 and in the column 2 q = 120. Whilst the calculation of the 
two marginal lines ( 2 q = 120 and 2 p = 120 ) necessitated special methods, the 
extra entries in the interior of the table were easily obtained by p-wise inter¬ 
polation, using suitable formulae of the Lagrangian type. 

( 3 ) Extension of the tables ofI x (p, q). The well-known recurrence'formula 

I x (p,q) = xl x (p-l,q) + (l~x)l x {p,q-l) (6) 

is particularly convenient if I x {p,q) is required for a lattice work of integer 

11-2 
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co-ordinates 2p, 2 q in a certain range and for a limited range of x. For small p 
and large or moderate q all percentage points of q) are clustered near 0. It 
appeared worth, while, therefore, to use the recurrence formula (6) to extend 
Pearson’s table by producing 

Up,q) for 2p = 1(1)7, 2q = 1(1)30 
and £ = 0*001(0-001)0-012, 0-015 and 0-025 

in order to obtain more of the required percentage levels x by inverse interpolation 
at intervals 0-005 and 0-001 respectively.* This made it possible to obtain a 
large number of percentage points a; in a range where Pearson’s tables are not 
interpolable. 

To start the recurrence process, the functions 


4(i3)> 4(P>i)» 4(1,3) and I x (p, 1) 
are required for the above ranges of p and q. The first two functions were obtained 
from the expansions 


4 ( 1 . ?) 



2(y~l), ,2 (g-l)( g -2) 

3 1! 33 + 5 2! * 


2 (g- 1) (g-2) (g-3) . 

7 3! + 


( 7 ) 


, , 1 lap 1.3£»+ 2 1.3.6a;»+ 3 } 

<c\PA)~ B ( pj ^)( i , + 2 . l! ( i , + 1 ) + 2 2 .2 ! (p + 2) + 2 8 . 3! (p + 3) + ''‘)' ' > 


In dealing with the expansion of 4(i>3)> all terms required for 10-decimal 
accuracy in / 0 . 02B (|, q) were first produced and then reduced for smaller values of 
x by multiplying by the appropriate power of a/0-025. The quantities 


4(b3) * l-i(l-x)«, 4(3>,1) = ^ 

were produced with the help of logarithmic tables. The remaining functions 

4(1-5,8), 4(2-5,3), 4(^-5,3) and 4( 2 ,3), 4 ( 3 ,3) 

were then obtained by four recurrences covering the following combinations of 
the parameters 2 p and 2 q. 

Odd values of 2 p and odd values of 2 q 

Even „ odd „ 

Odd „ even „ 

' Even ,, even „ 

Having thus dealt with the main body of the tables we now turn to the more 
difficult problem of calculating entries x near the margin of each table of per¬ 
centage points. In what follows, methods will differ according to whether 2 p is 
odd or even, 


* [It is hoped that at a future date it 1 -> 
4(2>> l) as a supplement to the Tables of the . 


" " di these extended values of 
Ed.] 
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(4) Building up the polynomial part of I x (p,q) from a constant high*order 
difference ( 2 p even). For integer p, 

B x(p>q) = 

may be expressed as a polynomial in (1 - a;). We have 

B (P> 2) ~ B a (p, q) = (l-a/s (- f Or 1 , (10) 

i=o q+% 

or, introducing y = 1 — x, 

B v { q , P ) = £(-)* Or 1 —-- (11) 

{=0 2 +» 



B v (q, p) is therefore the product of the gth power of y and a simple polynomial in y. 
If p is small ( 2 p^ 12 ) this polynomial can be built up easily on the National 
machine (2). As an example, for 2q = 60 and 2p = 6 we have the equation 
14880{U(3, 30)- B x (3, 30)} = 14880^(30, 3) = y 30 (496-960y + 465y 2 ). 

The polynomial 496 — 960y + 466i/ 2 was built up on the National machine from 
its constant second difference for values of y beginning at y = 1 and descending 
at interval 0-001. The polynomial values were multiplied by the 30th power of 
the argument and the products checked by differencing. Finally the percentage 
points y (or x) were found by inverse interpolation. This method was used for 
2 q = 40, 60, 120 and 2p = 2 , 4, 6 , 8 , 10 , 12 . For larger values of p the building 
up process becomes too laborious. On the other hand, with increasing p the 
percentage points x increase in value, so that for values of 2 p greater than 12 and 
not exceeding 100 results could be obtained from Pearson’s tables by inverse 
interpolation. 

(5) Taylor expansion at approximate percentage point (2p even). It remains, 
therefore, to consider the last column 2 q = 120 for 2p p 14, For small x (i.e. values 
of y in the neighbourhood of 1 ) the terms of the expansion ( 11 ) have to be calculated 
to a very high degree of accuracy, since these terms have alternating signs and 
many significant figures are lost when adding to produce B y (q , p), which is very 
small. Since seven significant figures are required for B y (q, p), in some cases 20 
decimals are required for the terms in ( 11 ), and their computation becomes 
laborious. A method was, therefore, evolved whereby B y {q , p) has to be calculated 
for one single three-decimal argument only. Although the function B y {q, p) is 
difficult to compute, its derivatives are easily calculated. It is, therefore, natural 
to use a Taylor expansion 

B y+ ,Aq,p)-B y (q,p) = hy«- 1 {l-y)r- 1 {l + %h($ : ~- ( 12 ) 


With a known* three-decimal approximation y to the exact percentage point 
y + h the main task consists in the calculation of B y [q, p). This was done from 
formula (11), using tables of powers (9), and a high capacity electric calculating 
* It will be shown later how this approximation was obtained. 
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machine. The correction, h to the approximation y was then easily obtained by 
iteration from equation ( 12 ). With h 0 — 0 the iteration is given by 

AB l[ l +i ^(ir~?4) + ' 


h 


)!+l 


where 


AB = 


[B{p,q)P-B v (q>p )] 


(13) 


(l -y) v ~ x iP' t ' 

Since the numerator, AB, of equation (13) does not vary, the corrections h v h 2 
and A 3 are easily produced in turn, three steps being sufficient in most cases. 
Occasionally the term arising from the third derivative had to be included in. the 
denominator. In this way the percentage points were calculated for 


2p = 14(2)22 2 q = 120 , 

2 q = 14(2)22 2p = 120, 

the values for 2 q= 14, 16, 18, 22 and 2p = 120 being required for checking by 
differencing. 

A word has to be added concerning the three-decimal approximation to the 
percentage points of x for 2q — 120 and 2 p = 120. In some cases such values 
could be obtained from Fisher’s table of percentage points of z using the trans¬ 
formation r) 

x = —=—5; ■ 

p + qe u 


In cases where such values are not available they were obtained by harmonic 
interpolation. More precisely, the finite limits 

lim 2qx and lim 2p(l~x) 
q~> oo p-^-oo 

p - constant q — constant 

werefirst obtained from the functions I(u,p) of Tables of the Incomplete T-f 'unction. 
The above limits depend on p and q and-are given by 2 u fp and 2u fq respectively, 
where u is the root of I(u,p~ l) = P and I(u, q- 1 ) = 1 - P respectively. The 
quantity 2 qx, being known for the arguments 1/2 q = 0 / 120 , 2 / 120 , 3 / 120 , 4 / 120 , 
6/120 and 6 / 120 , was then obtained (to about four-deciinal accuracy) for the 
argument 1 }2q = 1/120 by a Lagrangian interpolation formula. Similarly the 
quantity 2 jp(l ~x) was calculated for l/2p — 1 / 120 . 

We are left, therefore, to consider the entries in the column 2q = 120 with 
2 .P °dd, and in the row 2p = 120 with 2 q odd, and also certain entries in the top 
right-hand corner for 2 q > 30 and odd 2p < 13. 

( 6 ) Binomial expansion with fractional index (2 p odd). In the top right-hand 
corner values of x are small and it is therefore to be expected that the expansion 

£(2M)P ==£{-)* Of- 1 — 
i =0 P + l 

is reasonably convergent for such values of x. 


(14) 
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The coefficients of the expansion (14) were calculated for 2 q = 1(1)30, 40, 60 
and 120, and 2 p = 1(1)9, The root x of the equation (14) was then found by a 
suitable iteration process. 

In some cases (2 p = 1,1 4.2q ^30) it was found convenient to invert the ■ 
expansion (14). With 2p — 1 the equation (14), if regarded as an expansion in 
powers of \jx, may be reversed to yield *Jx as an expansion in powers of B(p, q) x P. 
Because of the particular importance of the case 2p = 1 (the ^-distribution) it is 
of interest to give here examples of formulas from which any percentage point x 
may be obtained directly by substituting the corresponding percentage level P. 
If D = \B(p,q)P = lB x (p,q), the first five terms of the reversed expansion 
are as follows: 

*Jx = DDS+ l l^Z 2)5 
o ou 

, (q— 1) (127g 2 — 131g+ 34) . 

* 630 

(q - 1) (4369g 3 - 6285 ? 2 + 3042? - 496) 

+ 22680 D + -’ 

from which the expansion for any particular q may be worked out without 
difficulty. Thus for q — 10, 

V* = D+W 3 + 19-8D 6 + 163-2D 7 + 1496-2Z) 9 + ..., 

and for q = 25, 

<]:x m D + 8Z> 3 + 136-8Z> 6 + 2900' 3D 7 +68162-OD 9 +.... 

(7) 'Numerical integration. For large p and q, when the integral I x (p,q) 
approaches the normal probability integral, a variety of methods has been 
developed(6,7,9). With mechanical computing aids available, numerical integra¬ 
tion appeared to be the simplest. The integrand x v ~ 1 { 1 — a;)® -1 represents a smooth 
curve and was produced at interval 0*01 with the help of logarithmic tables and 
checked by differencing on the National machine. Numerical integration was 
performed by Gauss’ formula and the integral B x {p,q) checked by differencing. 
Finally, x was obtained by inverse interpolation and checked by the application 
of Taylor’s expansion at the tabular value nearest to x. This method was used for 
2 q = 120 and.2 p = 24, 26, 28, 30, 40, 60 and 120 and also for 2p = 120 and 
2 q = 24, 30, 40 and 60. 

For 2q = 120 and 2p = 7(2)29 all percentage points were obtained by p-wise 
interpolation between the entries 2 p = 2(2)30, 40 and 60, using appropriate 
formulae of the Lagrangian type. 

(8) Approximation by ihe incomplete r-function. It remains to consider the 

entries for 2p = 120 and 2 q = 1(2)9. 

There appears to be a lack of suitable methods for obtaining accurate results in 
this range of q for isolated large values of p. Three-decimal approximations x 0 to 
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the percentage levels may be obtained by harmonic interpolation, as described 
in §(5) * To obtain the correct percentage points (a; = x 0 + h), the main task 
consists in calculating 4,(60, q) to six places of decimals. This was done with the 
help of a recently developed approximate formula giving 4(ib r l) terms of the 
incomplete T-function.f This formula is akin to a Taylor expansion of l x [p, q) 
in powers of l/2jp at l/%> = 0 (2p = oo) and may be written as follows'. 

J i (^(^ + (2p)2 + (2p) a+ ' , '|’ ^ 


p{l~x) „ ., p(l~X) 

where u =» ——— and A = ——- 

xfq x 

and the terms r I\ are dependent on A and q only, the first two being 

T t = q~ 1-A 

4a 8 —faf*+te-t+(—i:)A+(te-4)A«-4A* 

This formula is very accurate for large p and small or moderate q. The terms T t 
and T\, were calculated in each case for 


A = An 




where « 0 denotes a suitable three-decimal approximation to the true percentage 
point. To obtain T s we make use of the fact that equation (15) should yield 
•4(50, q) to seven-decimal accuracy. Since 4(50, q) is obtained to that accuracy 
from Pearson’s table and since T v and T 3 depend on A and q only, we may use 

7) 

equation (15) to determine T 3 by substituting p = 50, A *= A 0 , x = x 1 = t-, 

Ao +P 

% ' ■. With T v 1\ and T 3 computed, 4 o (6Q, q) is easily obtained from 

*iV2 

equation (15). Finally the exact percentage point aj 0 + h is calculated by the 
iteration process (13). 

Checks 


The main body of each table of percentage points (i.e. the interior of each table) 
was checked by differencing p-wise and g-wise at interval For large q and 
moderate p, x may be considered as a function of l/2g arid differenced at interval 
1/120, i.e. for 1/2 q = 0/120 (1/120) 6/120. Similarly a; maybe differenced for large 
pandmoderatevaluesofgfor l/2p = 0/120(1/120) 6/120. Four significant figures 
may be checked in this way, thus eliminating any possibility of serious errors. 
For small p and large q, the quantity * is almost linear in 1 j2q so that a good check 
was given by examination of the product 2 qx, which is almost constant. Never- 

* In this case it was sufficient to use a Lagrangian formula for interpolation between values of * 
(with » = 1 for 2p => ®), instead of performing the more complicated interpolation between values 
oF2p(l-*). 

t 1 he derivation of this formula is given in a paper by H. O. Hartley which will, it is hoped, 
be published in the next issue of Biometrika. 
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theless, the only available check to guarantee five-decimal accuracy at the margins 
was recomputation. As far as possible repetition of the method employed in the 
first instance was avoided. Thus inverse interpolation was replaced by direct 
interpolation or by a Taylor expansion at a tabular value; iteration processes 
were varied in the formulae employed. 


METHODS OP INTERPOLATION 
By H. 0. HARTLEY 
Introduction 

In so far as the table is required in connexion with standard tests of significance 
the user will be concerned with obtaining x for any percentage level P and for 
integer values of iq = 2 p and iq = 2 q. 

The values of P and the row and column headings (2 p, 2q) have been selected 
in such a way that the user will generally find the required value of x tabulated. 
Moreover, for most of the applications it. suffices to estimate roughly the magni¬ 
tude of interpolates from an inspection of the table. In some cases, however, 
interpolation to about five-decimal accuracy is necessary. The problem of inter¬ 
polating between corresponding entries in different tables of percentage points 
(interpolation P-wise) will, it is hoped, be dealt with elsewhere and we are here 
only concerned with interpolation in each individual table of percentage points 
x(2p, 2 q) to find x for any combination of integer arguments 2p, 2q* 

In the present tables interpolation to integer arguments v 2 = 2 p, iq = 2 q 
will occur in three different forms: 

(1) Single-entry interpolation g-wise in the range 1 ^ 2p ^ 30 and 10 < 2q < oo. 

(2) Single-entry interpolation p-wise in the range 1 ^ 2q < 10 and 30 < 2 p< oo. 

(3) Double-entry interpolation for 30 ^ 2 p, 10 ^ 2 q. 

If both 2 p and 2 q are large, interpolation in the tables is impractical, and it 
Was therefore necessary to add a fourth section, namely: 

(4) Approximate calculation of percentage points if both p and. q are large. 

It will be noted that, following the lay-out adopted in other tables of per¬ 
centage points (3, 4), the column headings jq = 2q => 20, 24,. 30, 40, 60, 120 
and oo are in harmonic progression. If, therefore, 1/2 q is used as a variable, 
these columns form a tabulation of x at equidistant intervals of the variable 
l/2q. The same harmonic progression is given for the row headings v 2 = 2 p, 
although here the tabulation at unit interval goes up to 2 p = 30, because of 

* Fractional values of 2p and 2q oocur in a number of applications when the percentage points 
of certain Pearson-type curves are required. Hi such cases it will be found most convenient to 
apply single-entry interpolation formulae, first in one direotion {p or q) and then in the other. 
Methods akin to those given here cover the range 2q > 10, 2 p > 30. For the range 0 ^ 2p < 30, 
0<2j<10, successive single-entry interpolation (p-wise or g-wise) at unit interval should afford 
no difficulty provided the arguments of the interpolate do not lie within the strips 0<2p<3, 
0 s; 2q < 2. Within these strips interpolation cannot be carried out without the aid of auxiliary tables. 
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the importance of these values for certain tests of significance. The use of the 
variables 1/2 q and l/2p greatly facilitates interpolation .but even with this 
device (known as harmonic interpolation) high-order interpolation fomrake 
have to be used in many cases, if the accuracy of the tabular values is required. 

To facilitate single-entry interpolation, therefore, an auxiliary table of Lagran- 
gian coefficients has been prepared. Although this auxiliary table has been 
specifically designed to meet the requirements of the present tables of percentage 
points of x, it is given and described in a separate paper (p. 183) since it is felt 
that it will have a wider application to any table of percentage points with a 
similar lay-out. 


1. Single-entry interpolation q'-wise 

No interpolation is required for the range 1 ^2g< 10(1 < 10). Tor 

v i - 2y> 10 interpolates are obtained with the help of the auxiliary table on 
pp. 183-5 of this issue, and its use is best explained in terms of an example. 
Example 1. Find the 5 % point corresponding to 2 p — 26, 2 q = 74. 

In the auxiliary table (p. 185 below) enter the row headed 74, that is, the row 
whose heading is equal to the value of 2 q for which the interpolate is required. 
The entries in this row are the (Lagrangian) multipliers in a sum of products 
which yields the interpolate x, The corresponding multiplicands are taken from 
the table of 5 % points x (2p. 2q). We enter the row headed ip ~ 26 and select 
entries *(26, 2q) for 2 q — 20, 24, 30, 40, 60, 120 and go. These correspond to the 
column headings in the auxiliary table. The sign of each product is also given at 
the top of the columns. We have, therefore, 

*(26,74) = +0-395 16x0-005 867-0-357 56x0-045 623 + 0-313 14x0-162 013 
-0-259 66x0-372 737 + 0-193 79 X 1-018 370 + 0-110 24x0-247 951 
= 0-164 64. 

This may be compared with the exact value 0-164 637 obtained by inverse 
interpolation from Pearson’s tables. 

The accuracy of the interpolates depends on 2 p and 2q and (to a lesser extent) 
on the percentage level P. In favourable cases, if 2 p is small and 2 q moderate, 
the interpolate is accurate to 5 places of decimals. In the least favourable cases, 
for 2 p near 30 or 2 q large, the fifth decimal of the interpolate may he in error. One 
more example is given to demonstrate the use of the auxiliary table. 

Example 2. Find the 50 % point for 2p = 11, 2q = 17, 

Entering the row 17 in the auxiliary table and the row 2p = 11 in the table 
of 50 % points we have 

*(U> 17) = +0-525 38 x 0-003 097-0-476 96 x 0-037 459 + 0-419 02 x 0-409 711 
+ 0-348 45x1-213 958-0-307 07x0-856 212 + 0-260 64x0-315 162 
-0-208 18x0-048 257 
= 0-387 62. 
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It will be noted that for 2q = 16, 17 , 18 and 19 two alternative rows are given in 
the auxiliary table, one (which has been used in the above example) is under the 
heading ‘Harmonic’ Lagrangian coefficients. The other row contains ‘Ordinary’ 
Lagrangian coefficients. It is in this range of 2q that there is little to choose 
between the merits of ordinary and harmonic interpolation, and the use of both 
methods provides a good check. Reworking the above example and using ordinary 
Lagrangian coefficients we have 

*(11,17) = -0-553 46x0-306 397 + 0-525 38x0-780 000-0-476 96x0-983 025 
+ 0-419 02 x 1-258 272 + 0-348 45 x 0-289 546-0-307 07 x 0-040 124 
+ 0-260 64x0-001 728 
= 0-387 62. 

There is satisfactory agreement between the two interpolates and the exact 
value (obtained from Pearson’s table) which is 0-387 619. 


2. Single-entry interpolation p- wise 

No interpolation is required for 1 < 2p < 30 (1 < v 2 ^ 30). For v 3 ~ 2p> 30 we 
again use the auxiliary table onpp. 184,185 of this issue. This time, however, the 
argument 2 p of the interpolate determines the row to be entered v in the auxiliary 
table, whilst column headings of this table are made to correspond to selected 
rows in the table of percentage points. The method is best explained by the 
following examples. 

Example 3. Find the 0-5 % point corresponding to 2q = 4 and 2 p = 96. In 
the auxiliary table enter the row headed 96, The entries in this row are the 
Lagrangian multipliers. The corresponding multiplicands are taken from the 
table of 0-5 % points. We enter the column headed 2q = 4 and select entries 
x(2p, 4) for 2 p = 20, 24, 30, 40, 60, 120 and oo which correspond to the column 
heading in the auxiliary table. 

The sign of each product is also given at the top of the columns. We therefore 
have 

*(96,4) = +0-491 44x0-005 875-0-550 98x 0-044 647 + 0-618 64x0-152 207 
-0-695 71 x 0-318 909 + 0-783 70 x 0-558 090 + 0-884 42 x 0-669 708 
-1-000 00 x 0-022 324 
= 0-857 94, 

which differs by 2 uniife in the fifth decimal from the exact value (0-857 92) 
obtained by inverse interpolation from Pearson’s tables. 
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Example, 4. Find the 5 % point corresponding to 2p — 80 and 2q = 30. We 
have 

3(80, 30) = +0-246 39x0-000 836-0-292 08x0-052 734 + 0-352 00x0-184 570 
- 0-433 21 x 0-410 156 + 0-548 07 x 0-922 851 + 0-720 16 X 0-369 141 
-1-000 00 x 0-020 508 
= 0-624 69. 

The exaot value is 0-624 75. 

Again, the accuracy of the interpolates depends on 2 q, 2 p and to a lesser 
extent on P. Five-decimal accuracy is obtained for small 2 q and moderate 2p> 
whilst only 4 decimals are reliable if 2q is near 30 or 2p is large. 

3. Harmonic double-entry interpolation 
In this section we deal with interpolation in the range 
30^2p<oo, 10<23 <oo, 

provided the arguments of the interpolate are not * too large *. The exact meaning 



Fig-1- 


of this restriction is that if the methods described below are used to find inter¬ 
polates in the range 

60 ^ 2p < oo, 40 < 2q < oo, 

the results obtained are unsatisfactory. In this range the user should, therefore, 
proceed on lines described in section 4. 

The method is essentially double-entry interpolation between points of the 
lattice work shown in Fig. 1. 

The 5, 2J, 1 and £ % values (i.e. the quantities x for P = 0-05, 0-025, 0-01 and 
0-005) are practically linear to three-figure accuraoy in the diagonal direction 
indicated by the broken lines in Fig. 1, 
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To explain the method it will be convenient to regard the percentage points 

x as functions of 60 

V = -v— and £ = —, 

2 p *2q’ 

and to introduce the notation 

x[ti, £] = x(2p, 2q). 

The relation between the argument y, £ and 2p, 2 q is demonstrated in Fig. 1. 
To obtain the interpolate x for any p, q in the above range calculate 


and find 


r 60 , 120 

5 =vr and V = ~n— 

2q 1 2 p 

3 = integral part of £, 

H = integral part of y, 

!> = (£-5) + (?-//)-l. 



Pig. 2. 


If (i is positive consider the parallelogram with vertices at [H, S+ 1], [ H+ 1, 5], 
[// +1,5'+ 1] and [H + 2,5] (see Fig. 2}. Now calculate two interpolates x 1 and 
at points P l and P % from the (approximate) formula? 

/ix[H+2,E] + (l ~p)x[H+l,B] ) 
and finally find the interpolate x[y, £] 

*lv,Q~(Z-5)x l +( i a+ l-fl*,. (17) 

If p is negative the points P 1 and P 2 will be at distance p below the points [H, 5+1 ] 
and [// +1,5'] respectively, and we have the formulae 

** =1,5+1] + (1+/*)*[#, 5+1] 
x a ~ ~px[H,3] + {l+p)x[If+l,3] 
in place of equations (16). 
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Example 5. Find the 1 % point * for 2p = 42, 2 q =' 16. 

We have 

g = 3*75, 9/= 2-8571, £ = 3, if =2, ^= 0-0071. 

To apply formulae (16) the tabular values are taken from the table of 1 % points, 
where we find to 4-decimal accuracy 

£B[3, 4] = *(40,15) = 0-5140, *[4,3] = *(30, 20) = 0-3705, 
x[2, 4] = *(60,15) = 0-6297, *[3, 3] = *(40, 20) = 0-4578. 

Applying the equations (16) we obtain 

aq = 0-6071 x 0-5140 + 0-3929 x 0-6297 = 0-5595 
* 2 = 0-6071 x0-3705 +0-3929x0-4578 = 0-4048, 

Finally we calculate 

*[2-857,3-75] = *(42,16) = 0-75 x 0-5595 + 0-25 x 0-4048 = 0-5208. 

The exact value obtained from Pearson’s table is 0-5163, If higher accuracy is 
required we have to improve the approximate relations (16) by adding the 
second-order difference effect. If this is done we obtain 

aq = 0-5555, aq = 0-4016 and * = 0-5170, 

which agrees satisfactorily with the exact value. If this method is applied to the 
tables of 10, 25 and 50 % points and if a similar precision is required, the second- 
order difference effeot should also be considered when interpolating along the 
diagonals. In such cases the right-hand side of equation (17) should have four 
terms. 

4. CALCULATION OS’ "PERCENTAGE JOINTS IE BOTH f AND q ARE LARGE 

If x is required foT values of 2 p and 2 q in the range 2 p > 60, 2 q > 40, inter¬ 
polation between the tabular values is not possible because of the singularity of 
* at 2p = co, 2 q = co. In this range therefore * has to he calculated ah initio. 
Certain approximate formulae for the incomplete beta-function axe valid in 
this range(8,10). These formulae, whilst useful for a calculation of P = S) 
as functions of *, 2p and 2q, cannot be easily inverted to yield * for the given 
percentage levels P. 


Auxiliary table 

y = normal deviate at level Pj, X = $(j/*+3) 


p 

0-50 

0-25 

0-10 

0-05 

0-025 

0-01 

0-005 

y 

0-0000 

0-6746 

1-2816 

1-6449 

1-9600 

2-3263 

2*5758 

X 

0-5000 

0-5758 

0-7737 

0-9509 

1-1402 

1-4020 

1-6058 

A-4 

0-3333 

0-4092 

0-6071 

0-7843 

0-9736 

1-2363 

1-4392 


A more convenient approximation of sufficient accuracy has recently been 
given by Cochran(i), who extended a method suggested by Fisher (i), It is 
essentially an approximation (by the normal distribution) to Fisher’s z-trans- 
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formation of a; and it involves values of the normal deviate y at the appropriate 
percentage levels P. These normal deviates y together with a function of y 
denoted by A are tabulated above for the levels P with which we are concerned 
here. 

To find an approximation to x for any pair of arguments 2 p, 2 q in the above 
range, calculate in turn the quantities 


2p + 2q 

y (A-fr)(A-2p) 
-J(A - A) pA 


x = 2 P 
* 2p +2 qe 2 *' 

As examples we consider two tabular values of a:.in order to obtain some idea of 
the accuracy of the approximation. 

Example 6. P = 0*01, 2p = 120, 2 q - 40, 


8x60x20 2*3263 1*236(-60) 

160 ’ Z ~V 58 ' S98+ 60x60 

This agrees with the exact value to four decimals. 


0*2833, x = 0-6299. 


Example 7. P = 0*50, 2 p - 30, 2 q — 120, 


A = 


8 x 15x60 
150 


= 48, 


0*333x18 

15x48 


0*00833, x = 0*1974. 


This differs from the exact value by about a unit in the fourth decimal. 
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Percentage points of the incomplete beta-function 


Beta Distribution: 50 per cent Points nor x 


v 1 =2q v^~2p 


V 

l %\ 

1 

2 ’ 

3 

4 

5 

6 

7 

8 

9 

1 

0-50000 

0-26000 

0*16319 

O '12061 

0-095526 

0-079033 

0 067378 

0-058711 

0-052016 

2 

•75000 

•50000 

•37004 ■ 

•29289 

•24214 

•20630 

•17966 

■15910 

•14276 

3 

•83681 

•02996 

•50000 

■41363 

■35245 

•30695 

•27181 

•24386 

•22112 

4 

•87939 

•70711 

•58037 

■50000 

•43556 

•38573 

■34609 

•31381 

•28703 

5 

0-90447 

0*76786 

0-64755 

0-56444 

0-50000 

0-44867 

0-40684 

0-37213 

0-34286 

6 

■92097 

•79370 

•09305 

■61427 

•55133 

•50000 

•45737 

•42141 

■39008 

7 

•93262 

•82034 

•72819 

•65391 

•59316 

•54263 

•50000 

•40355 

•43205 

8 

■94129 

•84090 

•76614 

•08619 

•62787 

•57859 

•53645 

•50000 

•40818 

9 

•94799 

■85724 

•77888 

•71297 . 

•65714 

•60932 

•50795 

•53182 

•60000 

10 

0-95331 

0-87056 

0-79775 

0-73555 

0-68214 

0-63588 

0-59546 

0-55984 

0-52824 

11 

•95765 

•88169 

•81366 

•75484 

•70376 

•65907 

■61968 

•58471 

■55346 

12 

•96125 

•89090 

■82725 

•77151 

•72262 

•67948 

•64116 

•60602 

•57613 

13 

•96429 

•89885 

•83899 

•78600 

•73923 

•69759 

•66036 

•62687 

•59661 

14 

•96689 

•90572 

•84924 

•79887 

•75396 

•71376 

•67760 

•64490 

•61520 

15 

0-96913 

0-91172 

0-86827 

0-81033 

0-76712 

0-72830 

0-69318 

0-66127 

0-63216 

16 

•97109 

•91700 

•86627 

•82038 

•77894 

•74143 

•70732 

•67020 

•64768 

17 

•97282 

•92169 

•87342 

•82950 

•78963 

•76334 

-72022 

•68980 

•06195 

18 

•97436 

•92587 

•87985 

•83774 

•79932 

•76421 

•73203 

•70242 

•67511 

19 

•97572 

•92964 

•88565 

•84522 

•80817 

•77417 

•74288 

•71401 

•68728 

20 

0-97696 

0-93303 

0-89092 

0-86204 

0-81626 

0-78331 

0-75289 

0-72472 

0-69858 

21 

•97806 

•93612 

•89573 

■86828 

■82370 

•79175 

•76215 

•73467 

•70909 

22 

•97907 

■93893 

•90013 

•80402 

■83057 

•79965 

•77074 

■74392 

•71889 

23 

■97999 

•94151 

•90417 

•86931 

•83692 

•80679 

•77873 

•75254 

•72805 

24 

•98083 

■94387 

■90790 

•87421 

•84281 

•81363 

•78018 

•76061 

■73663 

25 

0-98101 

0-94606 

0-91136 

0-87876 

0-84828 

0-81981 

0-79315 

0-76816 

0-74469 

26 

•98232 

•94808 

•91465 

•88298 

■85340 

•82568 

•79968 

•77526 

•75227 

27 

■98298 

•94095 

•91753 

■88692 

•85817 

•83118 

•80581 

•78193 

•76941 

28 

. -98360 

■98170 

•92031 

■89060 

•86206 

•83635 

•81157 

•78821 

•70616 

29 

•98417 

•96332 

•92290 

•89400 

•86685 

•84120 

•81701 

•79415 

•77253 

so 

0-98470 

0-95484 

0-92534 

0-89730 

0-87080 

0-84678 

0-82214 

0-79976 

0-77856 

40 

•98855 

•96594 

•94324 

■92136 

•90038 

■88030 

•80107 

•84266 

•82501 

60 

•99238 

•97716 

•96104 

■94046 

•93166 

•91731 

•90338 

■88985 

•87672 

120 

•99620 

•98851 

•98056 

•97204 

•90482 

•95710 

•94951 

•94202 

■93466 

CO 

1-00000 

1-00000 

1-00000 

1-00000 

1'00000 

1 

1-00000 

1-00000 

1*00000 

' 

1-00000 


This table gives the values of * for which l x [p t g)=0'50 where jjs g=|v x . 
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Beta Distribution': 50 per gent Points for % 


v x -2q v a = 2 p 


F 

10 

12 

16 

. 20 

24 

30 

40 

60 

120 

i 

O ' O 40687 

0-038746 

0-030867 

0'023052 

0-019168 

0-015301 

0-011460 

0-0076166 

0-0037997 

2 

■12948 

•10910 

•088278 

•066967 

•066126 

■046158 

•034064 

•022840 

•011486 

3 

•20225 

•17276 

■14173 

•10908 

•092099 

•074664 

•056756 

■038355 

■019443 

4 

■26446 

•22849 

•18977 

•14796 

•12579 

•10270 

•078644 

■053562 

•027361 

5 

0-31786 

0-27738 

0-23288 

0 18374 

015719 

0-12920 

0’099622 

0-068335 

0-036184 

6 

•36412 


•27170 


•18647 

•15422 

•11970 

•082690 

•042896 

7 

■ 

•36884 


•24711 

•21382 

-17786 

•13893 

•096624 

•050494 

8 

•44016 

•39308 

•33873 

•27628 

■23939 

•20024 

■15734 

•11015 

•067977 


•47176 

•42387 

•36784 


•26337 

•22144 

•17499 

•12328 

•065345 



0-46169 

0-39461 

0-32675 

0-28689 

0-24154 

0-19192 

0-13603 

0 072602 

11 


•47696 


•34846 

•30707 

•26064 

•20818 

•14842 

•079747 

12 

•54831 


■44162 

•36967 

•32704 

■27880 

•22379 

•16046 

•086786 

13 


•62110 

•46264 

■38956 

•34689 

•29610 

•23880 

•17217 

•093716 

14 

•58811 


•48194 


•36371 

•31268 

■25326 

•18366 

•10054 

15 


0-66838 



0-38059 

0-32832 

0-26715 

0-19463 

6-10727 

16 

•62147 

•57492 

•61684 

•44234 

•39660 

•34335 

•28055 

•20641 

•11390 

. 17 





•41181 

•35772 

•29347 

•21691 

•12043 

18 

•64984 



•47274 

•42626 

•37147 

•30593 

•22613 

■12686 

19 

>06248 

■61788 

•66118 

•48673 

•44002 

•38466 

•31706 

•23609 

•13320 

m 



0-57421 


0-46314 

0-39729 

0-32968 

0'24680 

0-13946 

21 

•68622 




•46566 

•40942 

•34082 

•26627 

•14661 

,22 

•69648 


•59807 

•62468 

•47762 

•42108 

•36168 

•26450 

•15168 

23 




•63599 

•48906 

•43228 

•36219 

■27350 

•15767 

24 

•71411 

•67296 

•61941 

•64686 

■60000 

•44305 

•37237 

•28229 

■16357 

26 

0‘72260 

0'68213 

0-62924 


0-51049 

0-45343 

0-38222 

0-29086 

0-18939 

20 


•69079 

EB 

•66714 

■52064 

•46342 

•39177 

■29924 

■17513 

27 

•73816 

H ’ 

•04747 

•57662 

■63019 

•47306 

•40103 

•30741 

•18079 

28 

•74629 


•66693 


•53946 

•48236 

•41001 

•31640 

•18638 

29 


•71417 

•06399 

• 5943.7 

•54837 

•49133 

•41873 

•32321 

•19189 

30 

0-76846 



060271 

065696 

0-60000 

0-42720 

0-33084 

0-19732 

El 


•77621 

•73283 


•62763 

•57280 

•60000 

•39866 

-24791 

60 

■86397 

•83964 

•80537 


•71771 

•66916 

•60134 

•60000 

■33209 

120 


•91321 

■89273 


•83643 

•80268 

•75209 

•66791 

■50000 

00 

1-00000 

1-00000 



1 00000 

1-00000 

I 00000 

1'00000 

14)0000 


For. y,= oo, z-0 
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170 Percentage points of the incomplete beta-function 


Beta Distribution: 25 per cent Points for x 


>’1=2 q V,= 2 p 


V 

V) \ 

1 

2 

3 

4 

6 

6 

7 

8 

9 

1 

0-14646 

0-062600 

0-039003 

0-028309 

0-022173 

0-018216 

0-016463 

0-013416 

0-011853 

2 

•43750 

•25000 

•17452 

•13397 

•10870 

•091440 

•078908 

•009395. 

•061929 

3 

■59716 

•39685 

•29801 

•23885 

•19937 

■17113 

■14991 

•13339 

•12015 

4 

•68878 

•50000 

•39448 

•32635 

•27852 

•24302 

•21560 

•19376 

•17596 

5 

O'74711 

0-57435 

0-46936 

0-39775 

0-34546 

0-30660 

0-27390 

0-24828 

0-22707 

6 

•78726 

•62996 

•52848 

•45632 

•40198 

•35944 

•32516 

•29692 

•27323 

7 

•81050 

•67296 

•57009 

•50494 

•45001 

•40614 

•37021 

•34022 

•31478 

8 

•83872 

•70711 

•61516 

•54582 

■49117 

•44680 

■40996 

•37885 

•35219 

9 

■85610 

•73487 

■04773 

•58060 

•52678 

•48245 

•44521 

•41343 

•38597 

10 

0-87021 

0-75786 

0-67529 

0-01062 

0-55783 

0-51390 

0-47062 

0-44451 

0-41655 

11 

•88177 

•77720 

•09888 

•03051 

•68513 

•64184 

•50476 

•47257 

•44435 

12 

•89144 

•79370 

•71931 

•06929 

•00930 

•66679 

•53009 

•49801 

•46970 

13 

•89900 

•80793 

•73710 

•07941 

•03085 

•68921 

•55300 

•52116 

•49289 

14 

■90672 

•82034 

•75288 

•09730 

•06017 

•60946 

■57382 

•64230 

■ 

•51419 

15 

0-91285 

0-83124 

0-70084 

0-71332 

0-00758 

0-62782 

0-59282 

0-56169 

0-53380 

16 

•91823 

•84090 

•77932 

•72773 

•68336 

•64456 

•61021 

•57953 

•55192 

17 

•92298 

•84951 

•79053 

•74077 

■69772 

•65986 

•62619 

•69599 

•56870 

18 

•92721 

•85724 

•80066 

•75203 

•71084 

•67391 

•64093 

■01122 

•58428 

19 

•93100 

•86422 

•80986 

•76345 

•72287 

•68686 

•66450 

•62536 

•59879 

20 

0-93442 

0-87055 

0-81825 

0-77337 

0-73396 

0-69882 

0-66720 

0-63852 

0-61234 

21 

•93751 

•87632 

•82593 

•78250 

•74418 

•70991 

•07895 

•65079 

•62600 

22 

•94033 

•88169 

•83299 

•79092 

•75360 

•72021 ' 

•68991 

•66226 

•63688, 

23 

•94290 

•88644 

■83950 

•79871 

•76246 

•72881 

•70015 

•67300 

•64803 

24 

•94526 

•89090 

•84553 

•80595 

•77066 

•73878 

*70973 

•68309 

•65852 

25 

0-94744 

0-89503 

0-86112 

0-81268 

0-7.7831 

0-74717 

0-71873 

0-69258 

0-66840 

26 

•94944 

•89885 

•86632 

•81896 

•78547 

•75505 

*72719 

•70151 

•67774 

27 

•96130 

•90241 

•86116 

•82484 

•79218 

•76244 

•73515 

•70995 

•08667 

28 

•95303 

•90572 

•86570 

•83035 

•79848 

•76941 

*74267 

•71793 

•69492 

29 

•95464 

•90882 

•86994 

■83552 

•80442 

•77598 

•74977 

•72548 

•70286 

30 

0-96614 

0-91172 

0-87393 

0-84039 

0*81002 

0-78219 

0-76649 

0-73263 

0-71038 

40. 

•96706 

•93303 

•90351 

•87685 

•85230 

•82947 

•80809 

•78797 

•76890 

60 

•97801 

•95484 

•93434 

•91548 

•89782 

•88113 

•86525 

•85009 

•83557 

120 

•98899 

•97716 

•96648 

•95047 

•94692 

• -93774 

•92887 

•92025 

•91187 

oo 

l'OOOQO 

1-00000 

1-00000 

100000 

1-00000 

hOOOOO 

l'OOOOO 

l'OOOOO 

1-00000 


This table gives the values of a for which Ijp, ? )=0-25 where p=iv v q-\v v 
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Beta Distribution: 25 per cent Points for x 


1/!=% Vj=2 p 


V 
h \ 

10 

12 

15 

20 

24 

30 

40 

60 

120 

1 

0-010016 

0-0087814 

0-0069734 

0-0051914 

0-0043101 

0-0034353 

0-0025669 

00017049 

0-0 3 84926 

2 

•056913 

•046816 

•037631 

•028358 

•023689 

•018990 

•014281 

•0095436 

•0047832 

3 

-10930 

•092592 

•075324 

■057467 

■048307 

•038986 

•029500 

•019844 

•010012 

4 

•16116 

•13797 

•I1360 

•087610 

•074095 

•060174 

•045827 

•031031 

•015764 

5 

0-20922 

0-18082 

0-15026 

0-11726 

0-099749 

0-081498 

0-062458 

0'042571 

0-021774 

6 

•25307 

•22068 

•18600 

•14585 

•12475 

•10251- 

•079043 

•054222 

•027923 

7 • 

■29291 

•25724 

■21758 

•17316 

■14888 

•12301 

•095405 

•065857 

•034143 

8 

■32908 

•29099 

•24802 

•19913 

•17203 

•14289 

•11145 

•077403 

■040396 

9 

•36198 

•32206 

•27644 

•22376 

■19420 

•16211 

•12712 

•088814 

•040056 

10 

039196 

0-35068 

0*30297 

0-24710 

0-21638 

0-18064 

0-14240 

O'10006 

0-052904 

11 

•41938 

•37712 

•32776 

•20921 

■23562 

■19850 

•15726 

•11113 

•059127 

12 

■44461 

•40168 

•35094 

•29017 

•25403 

•21670 

'171.71 

•12200 

■065317 

13 

•46702 

■42426 

•37265 

•31004 

•27338 

•23226 

•18575 

•13267 

•071466 

14 

•48893 

•44534 

•39302 

•32889 

•29100 

•24819 

•19938 

•14314 

•077570 

15 

0*60863 

0-46497 

0-41215 

0-34079 

0-30785 

0-26353 

0-21261 

0-15341 

0-083624 

16 

■62091 

■48330 

•43016 

•36380 

•32395 

•27831 

•22546 

•16347 

■089625 

17 

•54389 

•60043 

•44712 

•37998 

•33936 

•29254 

•23793 

•17332 

•096571 

18 

•56972 

•51649 

•46312 

•39539 

•35411 

•30625 

•25004 

•18298 

•10X46 

19 

•57449 

•63166 

•47825 

•41008 

•36824 

•31946 

•26179 

•19244 

•10729 

20 

0-58832 

0-54574 

0-49256 

0-42409 

0-38179 

0-33221 

0-27321 

0-20170 

011307 

21 

•60129 

•65909 

•60613 

•43740 

•39478 

■34450 

•28430 

•21078 

•11878 

22 

■61348 

•57169 

•61900 

•45025 

•40726 

■35037 

■29507 

•21966 

•12443 

23 

•62496 

•58360 

•53122 

•46247 

•41926 

•30783 

•30554 

•22837 

-13003 

24 

•63676 

•59487 

•54286 

•47418 

•43078 

•37891 

•31572 

•23690 

•13556 

25 

0-64607 

O'60556 

O'55392 

0-48539 

0-44186 

0-38961 

0-32601 

0-24525 

0*14103 

26 

•65663 

■61570 

•66447 

•49615 

•45263 

•39996 

•33524 

■25343 

•14645 

27 

•66478 

•62533 

•57455 

•50647 

•46281 

•40997 

•34460 

•26145 

•15180 

28 

•67346 

•63460 

•58417 

•51639 

•47272 

•41967 

•35371 

•26931 

•15710 

29 

■68170 

•64323 

•69337 

•52591 

•48228 

•42906 

•36259 

•27701 

-16234 

30 

0-68954 

0’65166 

0-60217 

0'58508 

0-49150 

0-43815 

0-37122 

0-28455 

0-16752 

40 

75095 

•71768 

•67308 

•61064 

■66853 

•51555 

•44650 

•35245 

•21626 

60 

■82163 

•79629 

•75915 

•70020 

•60914 

•62050 

•55390 

•46636 

•29913 

120 

•90370 

•88794 

■80554 

■■83103 

•80657 

•77041 

•71852 

•63381 

•46918 

co 

l'OOOOO 

l'OOOOO 

1-00000 

l'OOOOO 

1-00000 

1-00000 

l'OOOOO 

1-00000 

1-00000 


For yj = oo, #=0 
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Percentage points of the incomplete beta-functifyn 


Beta Distbibotton: 10 pee cent Points foe x 

vj =25 vt-ip 



1 

2 

3 

4 

5 

6 

7 

8 

1 

0*024472 

0-010000 

0-0001812 

0-0044577 

0-0034818 

0-0028653 


0-0020986 

2 

•19000 

•10000 

•067830 

•051317 

■041268 

•034511 


•025996 

3 

■35136 

•21544 

•15648 

•12310 

•10154 

•086434 


•060647 

4 

•46812 

•31623 

■24130 

•19580 

■16493 

•14256 

•12558 

•11224 

6 

0-55185 

0-39811 

0-31529 

0-26204 

0-22457 

0-19664 



0 

•61375 

•46416 

•37816 

■32046 

•27858 

•24664 



7 

•66104 

•51795 

•43151 

-37151 

•32685 

•29210 

•26421 

•24127 

8 

•69821 

•50234 

•47700 

•41611 

■30982 

•33319 



9 

•72814 

•69948 

•61610 

•45522 

•40811 

•37029 

•33915 

•31299 

10 

0-7S273 

0-03096 

0-64996 

0-48968 

0-44232 

0-40382 

0-37178 


a 

•77328 

•65793 

•57954 

•52022. 

•47300 

•43419 


•37374 

12 

•79069 

•68129 

•60555 

•54744 

•50002 

•46178 

•42889 


13 

■80504 

•70170 

•62860 

•57181 

•52560 

•48693 



14 

•81861 

•71969 

■64915 

■69375 

•54827 

•50992 

•47697 

•44827 

15 

0-82996 

0-73564 

0-66758 

0-61300 

0-58893 

0-63100 



16 

•83998 

•74989 

•68419 

•63164 

•58783 

•55040 

•61787 


17 

•84889 

•76270 

•69923 

•64809 

•60517 

•56829 



18 

•85686 

•77420 

■71293 

•66315 

•62114 

■58484 

Ess® 

•52473 

19 

•86403 

■78476 

-72644 

•67699 

•63588 

•60020 

•56876 


20 

0-87052 

0-79433 

0-73091 

0-68976 

0-04964 

0-61448 



21 

•87643 

•80309 

■74747 

•70156 

■86222 

•62779 

•59722 

•66980 

22 

•88181 

•81113 

■75722 

•71250 

•67403 

•64022 



23 

.•88675 

•81855 

•76026 

■72268 

■68504 

•65187 


•59547 

24 

•89129 

•82540 

■77404 

•73216 

•69535 

•66279 


•60721 

26 

0-89649 

0-83176 

0-78245 

0-74103 

0-70500 

0-67305 

0-64434 


26 

•89937 

•83768 

•78973 

•74933 

• -71407 

•68271 ■ 

•65446 

•02878 

27 

•90297 

•84319 

•79655 

•75711 

•72260 

•69183 


•63871 

28 

■90633 

•84834 

•80294 

■76443 

•73064 

•70044 


•64813 

29 

•90946 

•85317 

•80894 

■77132 

•73823 

■70868 

•68168 


30 

0-91239 

0-85770 

0-81459 

0-77783 

0-74541 

0-71630 

0-68984 

0-66659 

40 

■93381 

■89125 

•85693 

•82706 

•80025 

•77678 

•76319 

•73219 

00 

•95565 

•92612 

•90182 

•88023 

•86048 

•84212 

■82490 


120 

■97701 

■98236 

•94944 

•93773 

•92679 

•91643 

niiiiiifiiii 


00 

1-00000 

1-00000 

1-00000 

1-00000 

1-00000 

1-00000 

1-00000 



This table gives the values of * for which I, (p, 5 } = 0-10 where p=b’v gsk. 
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Beta Distribution: 10 per cent Points por x 


h-%t v i=%P 


s 

B 




24 

30 

40 

60 

120 

1 

0'0016S86 

0-0013709 

0-0010878 

0-0 a 80919. 

0-0*67167 

0-0 3 53506 

0'0 9 39965 

0-0 s 26636 

0-0*13213 

2 

■020852 

•017407 

•013950 

•010481 

■0087416 

■0069994 

•0052542 

•0035059 

•0017546 

3 

•054245 

■045740 

•037035 

•028119 

•023579 

•618982 

■014327 

•0096132 

•0048379 

4 

•092596 

■078823 

•064482 

•049452 

•041691 

■033749 

•025617 

•017288 

•0087521 

e 

013167 

0-11307 

0-093336 

0-072324 

0-061295 

0-049889 

0-038083 

0-025851 

0-013167 

0 

•16964 

•14685 

•12228 

■095853 

•081477 

•066668 

•051174 

•034941 

•017906 

7 

•20573 

•17941 

•15069 

•11886 

•10173 

•083668 

•064573 

•044345 

•022866 

8 

•23966 

•21040 

■17792 

•14161 

•12177 

•10064 

•078083 

•063928 

■027978 

9 

•27139 

•23970 

•20411 

•16374 

•14141 

•11743 

■091577 

•063600 

•033196 

10 

0-30097 

0-26732 

0-22908 

0-18513 

0-16056 

043394 

040497 

0-073298 

0-038489 

11 

■32863 

•29330 

•25284 

■20676 

•17915 

45010 

41820 

•08£977 

•043832 

12 

•35422 

•31772 

•27540 

•22659 

•19716 

•16687 

43123 

•092604 

■049206 

13 

•37817 

•34068 

•29682 

•24464 

•21457 

•18124 

44403 

•10215 

•054597 

14 

•40053 

•36228 

■31715 

•26292 

■23139 

49619 

•15659 

41161 

•059993 

15 

0-42143 

0-38261 

0-33645 

0-28046 

0*24762 

0-21072 

046889 

04 2096 

0-065386 

16 

•44100 

•40176 

•35478 

•29726 

•26327 

•22483 

48093 

43019 

•070768 

17 

•45934 

•41983 

■37219 

•31338 

•27837 

•23853 

49270 

43930 

■076134 

18 

•47057 

•43689 

■38875 

•32886 

•29293 

•25182 

•20420 

44828 

•081478 

19 

•49277 

■45302 

•40461 

•34369 

•30697 

•26471 . 

•21544 

• 45712 

•086796 

20 

0-50803 

0-46829 

0-41952 

0-35793 

0-32051 

0-27721 

0-22642 

046583 

0-092085 

21 

•52243 

•48276 

■43382 

•37161 

•33358 

•28934 

•23713 

•17440 

•097342 

22 

•53603 

•49649 

■44746 

•38475 

•34619 

•30111 

•24759 

48283 

40257 

23 

•54889 

•50953 

•46049 

•39738 

•35836 

•31253 

■25781 

■19112 

•10775 

24 

•50108 

•62193 

■47294 

•40954 

•37012 

•32361 

•26778 

49928 

41290 

25 

0-57263 

0-53373 

0-48485 

0-42123 

0-38147 

0-33437 

0-27761 

0-20730 

041801 

26 

•58361 

•64498 

•49624 

•43248 

•39246 

•34481 

•28701 

•21518 

•12308 

27 

•59406 

•55571 

■50716 

•44333 

■40306 

•35495 

•29629 

•22293 

42811 

28 

•60398 

•56595 

•51763 

■45378 

■41332 

•36479 

•30534 

•23054 

•13310 

29 

•61344 

•57574 

•52767 

■46386 

•42326 

•37436 

•31419 

•23803 

43804 

30 

0-62247 

0-58511 

0*53731 

0-47369 

043286 

0-38366 

0-32283 

0-24539 

044295 

40 

■69412 

•66034 

•61599 

•55476 

•51428 

46386 

•39910 

•31243 

•18960 

60 

■77851 

•75104 

•71386 

•66029 

■62333 

•57545 

•51067 

■41750 

•27063 

120 

•87897 

•86198 

•83814 

■80192 

•77563 

•73946 

■68688 

•60235 

•44158 

CO 

1-00000 

1-00000 

1-00000 

1-00000 

1-00000 

1-00000 

1-00000 

1-00000 

1-00000 


For vj=: GO, a=0 
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Beta Distribution : 5 pib cent Points nob x 


t > t ~ 2q vz - 2p 


V 

1 

2 

3 

4 

-T 

5 

6 

7 

8 

9 

1 

0-0061558 

0-0025000 

00015429 

0-0011119 

0-0 3 86820 

0*0 3 71179 

0-0=60300 

0*0 3 52300 

0-0=46170 

2 

•097500 

•060000 

■033617 

•025321 

•020308 

•016952 

•014548 

•012741 

■011334 

3 

•22852 

•13572 

•097308 

■076010 

•062412 

•062962 

•046007 

•040671 

•036447 

4 

■34163 

•22361 

•16825 

•13535 

•11338 

•097611 

•085728 

•076440 

•068979 

5 

043074 

0-30171 

0-23553 

0-19403 

0-16528 

0-14408 

0-12778 

0-11482 

0-10427 

6 

•50053 

•36840 

•29599 

•24860 

•21477 

•18926 

•16927 

•15310 

•13989 

7 

•55593 

•42489 

•34929 

■29811 

•26063 

•23182 

•20890 

•19019 

■17461 

8 

•60071 

■47287 

•39607 

•34259 

■30260 

■27134 

•24613 

•22532 

•20783 

9 

■63751 

•51390 

•43716 

•38245 

•34080 

•30777 

•28082 

•25835 

■23930 

10 

0-06824 

0-64928 

0-47338 

0-41820 

0-37563 

0-34126 

0-31301 

0*28924 

0-26894 

11 

•69425 

•58003 

•50546 

•45033 

•40712 

•37203 

•34283 

■31807 

•29677 

12 

•71654 

•60690 

•53402 

•47930 

■43690 

•40031 

•37044 

•34494 

•32286 

13 

•73583 

•63073 

•56958 

•50551 

■4-8219 

•42635 

•39604 

•37000 

■34732 

14 

•75268 

•65184 

•58258 

■52932 

•48626 

•45036 

•41980 

■39338 

•37025 

15 

0-78754 

0-67070 

0-60333 

0-65102 

0-50836 

0-47256 

0-44187 

0-41521 

0-39176 

16 

•78072 

•68766 

•62217 

•67086 

•62872 

•49310 

■46242 

•43563 

■41196 

17 

■79249 

•70297 

■63933 

•58907 

•54760 

•51217 

•48168 

■45474 

•43094 

18 

■80307 

•71687 

■65503 

•60684 

•56490 

•52991 

•49949 

•47267 

■44880 

19 

•81263 

•72954 

•86944 

•62131 

•58103 

•54645 

•51624 

•48951 

•46564 

20 

0-82131 

0-74113 

0-68271 

0-63584 

0-69605 

0-56189 

0-53194 

0-50535 

0-48152 

21 

•82923 

•75178 

•69496 

•04894 

•61004 

•67635 

•54669 

•52027 

•49052 

22 

•83047 

•76160 

•70632 

•60132 

•62312 

■58990 

•56050 

•53434 

•51071 

23 

•84313 

•77007 

•71687 

•67287 

•03636 

•60263 

•57303 

•54704 

•52415 

24 

•84927 * 

•77908 

•72669 

■08366 

•64684 

■61461 

•58596 

•56022 

•53689 

25 

0-85494 

0-78690 

0-73586 

0-09377 

0-05704 

0-62590 

0-69761 

0-57213 

0-54898 

26 

•86021 

•79418 

•74444 

•70327 

■06780. 

•03056 

•60864 

■58343 

•66048 

27 

•86511 

•80099 

•75249 

•71219 

•67738 

•64663 

•61909 

•59416 

•57141 

28 

■86967 

•80736 

•76004 

•72000 

•68643 

•66617 

■62900 

•60436 

•58183 

29 

■87394 

•81334 

•76715 

•72854 

■69499 

•60522 

•63842 

■61407 

•59177 

30 

0-87794 

0-81898 

0-77386 

0-73604 

0-70311 

0-67381 

0-64738 

0-62332 

0-60125 

40 

•90734 

•86089 

•82447 

•79327 

■76559 

•74063 

•71758 

•69636 

•67663 

60 

•93748 

•90497 

•87881 

•85591 

•83517 

•81006 

•79824 

•78150 

•76569 

120 

•96837 

•95130 

•93720 

•92458 

•91290 

•90192 

■89148 

•88150 

■87191 

OO 

1-00000 

1-00000 

1-00000 

1-00000 

1*00000 

J_ 

1-00000 

1-00000 

1-00000 

1 00000 


This table gives the values of x for which l x { p , 9)=0-OS where p =& s , q ~-, 
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Beta Distribution : 5 per cent Points por x 


v i =% v,= 2p 


V 

h\ 

10 



D 

B 


40 

60 

120 

1 




IfSB 

0-0*16727 

0 0 3 13326 

0 0 4 99536 

0’0 4 66082 

0-0*32904 

2 



■0068158 


•0042653 

■0034137 

•0025614 

•0017083 

•0 S 86452 

3 

n 


•022465 

ji 

•014264 

•011472 

•0086511 

■0057991 

•0029167 

4 

B B 


•043641 


■028053 

•022679 

•017191 

•011586 

•0058567 

5 



0-067312 


0-043994 

0-036747 

0-027240 

0-018458 

0-0093841 

6 

■12876 

•mil 

•002207 


•061103 

•049898 

•038224 

•026043 

•013317 

7 

■16142 


•11733 


•078783 

■064651 

•049781 

•034103 

■017640 

8 

•19290 

•16875 

•14216 

■11267 

•096658 

•079695 

•061676 

•042481 

•021976 

9 


•19618 

•16638 

•13288 

•11449 

■094827 

■073748 

•051068 

•026572 

10 

025137 

0-22244 

048984 


0-13211 

0-10991 

0-085885 

0-059785 

0-031288 

11 

•27823 

•24746 

•21244 

■17207 

•14943 

•12484 

■098008 

•068575 

•036094 

12 

•30364 

•27125 

•23413 


■16636 

•13955 

•11000 

•077394 

•040967 

13 

■32737 

•29383 

•26492 

•20908 

•18288 

■16401 

•12199 

•086209 

•045889 

14 

•34981 

•31524 

■27481 

•22669 

•19895 

•16818 , 

•13377 

■094994 

•050847 

16 

0-37095 

0-33554 

0-29382 

0-24370 

0-21457 

0-18203 

0-14539 

0 10373 

0-055827 

16 

•39086 

•35480 

•31199 

•26011 

■22972 

•19556 

•15682 

•11240 

•060821 

17 

•40966 

•37307' 

•32936 

•27594 

•24441 

•20877 

•16805 

•12099 

•065820 

18 

•42738 

•39041 

•34596 

•29120 

•26865 

•22164 

•17908 

•12950 

•070818 

19 

■44414 

•40689 

•30183 

•30591 

•27244 

•23418 

■18989 

•13791 

•076809 

20 

0-45999 

0-42256 

0-37701 

0-32009 

0-28680 

0-24639 

0-20050 

0-14622 

0-080789 

21 

•47601 

•43746 

•39154 

•33375 

•29874 

•25828 

•21088 

■15442 

■085753 

22 

■48926 

•45165 

•40544 


•31126 

•26985 

•22106 

•16252 

•090698 

23 

■50276 

■46518 

•41877 

■35964 

■32340 

•28112 

•23102 

•17051 

•095621 

24 

■61560 

•47808 

•43154 


•33515 

•29208 

•24078 

•17838 

•10052 

25 

0-52782 

0-49040 

0-44379 

0-38373 

0-34653 

0-30275 

0-25032 

0-18615 

0-10539 

26 

■53945 

•50217 

•45654 

•39516 

•35756 

•31314 

•26966 

•19379 

•11024 

27 

■55054 

•51343 

•40683 


■36826 

•32325 

■26880 

•20133 

•11506 

28 

•56112 

■52420 

•47768 

•41685 

•37862 

•33309 

•27775 

•20875 

•11983 

29 

•57122 

•53452 

•48812 

■42715 

■38867 

•34267 

■28650 

•21606 

•12458 

30 

0-68088 

0-54442 

0-49816 


0-39842 

0-35200 

0-29507 

0-22320 

0-12930 

40 

•65819 

•62460 

•58083 

•52099 

■48175 

•43321 

•37136 

•28936 

•17463 

60 

•75070 

•72282 

•68535 

•63185 

•59522 

•54807 

•48477 

•39458 

•26416 

120 

•86268 

•84504 

•82047 

•78342 

•7S661 

•72016 

•66738 

•58326 

•42519 

CO 

1-00000 

1-00000 

1-00000 


1-00000 

1'00000 

1-00000 

1-00000 

400000 


Eoi Vi= CO, * = D 
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Percentage ‘points of the incomplete beta-function 


Beta Distribution: 2-5 per cent Points roe % 

^=2 q V t = 2 p 




9-0 s 2l691 0-0 3 17 7 82 0-0 3 15064 0-0 3 13066 0-0*11533 
•010076 -0084038 -0072076 -0063096 -0066104 

■038748 *032820 -028471 -025143 -022613 

•078706 -067586 -059243 -062745 -047639 


■16606 -14663 


•24933 -22278 -20161 -18406 -16944 

-28642 -26774 -23460 -21523 -19897 


0-4781S 10-40856 10-35877 10-32071 0-29042 

•35234 -32086 

■38149 -34914 

•40838 -37545 

•43321 -39991 

0-61149 10-54628 10-49641 10-45618 0-42268 

•47746 -44390 

•49723 ' -46372 
•51561 -48224 

•53276 -49959 


0-54877 0-61586 

•86376 -53116 

•67780 -54563 

•69100 '55908 

•60341 -67187 


0-61611 0-58396 

•62615 -59340 

•67119 -63658 -60624 

•64646 -61652 

•65582 -62630 

0-66471 0-63559 

•73369 -70839 

•81156 -79193 

•89975 -88828 

1-00000 1-00000 1-00000 


0-26661 

•29482 

•32219 

•34779 

•37176 

0-39418 

•41520 

■43490 

■45341 

•47081 

0-48719 

•50263 

•51720 

•53098 

•54401 

0-55636 

•56808 

-67922 


0-60948 

•68532 

•77372 

•87743 

1-00000 


0-24486 

•27288 

•29930 

•32416 

•34755 

0-36966 

•39026 

■40976 

•42814 

•44549 

0-46187 

•47736 

•49202 

•50692 

■51911 

0-53163 

-54364 

-56488 

■56568 

•57599 

0-58582 

•66411 

•75668 

■86708 

1-00000 


, 0-22722 
•25409 
■27957 
•30368 
•32646 

10-34799 
•36833 
•38756 
•40575 
•42297 

0-43928 

•45475 

■46943 

•48338 

•49664 

0-50927 

•52130 

•53278 

•54373 

•55420 

0-56421 

•64448 

•74065 

■85717 

1-00000 


This table gives the values of * for which I„ ( p , q) = 0-025 where q=\v v 
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Beta Distribution ; 2-5 per cent Points for x 


Vi=2g v,=2 p 


1 

-... 

10 

12 

15 

20 

24 

30 

40 

60 

120 

1 

0-0 3 10323 

0'0 4 853 1 3 

0-0 1 67686 



0-0‘33285 

nil 


mui 

2 

•0050508 

•0042107 

•0033700 



•0016864 


•0 3 84 357 

•0*42187 

3 

•020382 

•017139 

•013838 



•0070519 

in 



4 

•043272 

•036693 

•029885 



•016514 


H| 


6 

0070233 

0-060028 

0-049302 

0-038002 


0-026068 

' 1 

1 


6 

•098988 

•085233 

•070563 

•054861 


•037985 



8 

7 

•12818 

■11113 

■092695 

Bn 

I 

•050772 

8 



8 

•15701 

•13700 

•11508 

liil 

plsflM 

•064092 

8 


8 

9 

•18604 

•10340 

•13732 


J 

•077712 

iSial 

EHil 

llJpa 

10 

0-21201 

0-18709 

0-15917 



0-091466 

0-071319 

0-049628 

0-026864 

11 

•23780 

•21091 

•18048 

•14665 

. 42623 

40523 

3 



12 

•26238 

•23379 

•20115 



41893 



1 11 

13 

•28573 

•25571 

•22112 

•18067 

•15770 

43249 



p^f 8 

14 

•30790 

•27667 

•24039 

•19753 

47299 

44588 

•11573 


l;!!' 1 

IS 

0-32893 

0-29668 

0-26893 

0-21392 

■ 


042669 

0090116 

0-048335 

. 16 

•34888 

•31578 

•27676 

■22983 


•17198 



9 

17 

•36779 

■33400 

■29389 


•21674 

48466 

•14823 


srSdIB 

18 

•38674 

•35138 

•31034 



49708 

45878 

•11444 

KM 9 

19 

•40278 

•36797 

•32614 

•27465 


•20922 

46916 

42244 

||1|9 

20 

0-41896 

0*38380 

034132 

0-28864 

0-25713 

1 

047938 

043038 

0-071749 

21 

•43435 

•39893 

•36589 

•30218 

•26985 

1 


43823 


22 

•44900 

•41338 

•36990 

•31628 

•28221 

■24402 



B 

23 

•46294 

•42720 

■38335 

•32795 




■ 

Bum 

24 

•47623 

■44042 

•39629 



■26587 




25 

0-48891 

0-45307 

040874 



0-27640 

0-22783 

046881 

0-095156, 

26 

•50101 

•46520 

•42071 


•32821 

•28667 


47622 


27 

•61257 

•47682 

43223 

■37406 

•33890 

•29669 


48364 

BCJmB 

28 



44334 


•34928 

•30047 

■25476 


B B 


•53421 

•49867 


•39584 

•36937 


•26339 

49789 

•11358 

30 

054435 

0-50895 

046434 

040594 

0-36918 

0-32632 

0-27186 

0-20492 

041812 

40 

•62616 

•59296 

•649.99 

49168 



•34780 


•16201 

60 

■72550 

•69743 

•65992 

•60674 


•52422 

•46239 

•37498 



•84764 

•82954 


■76678 


mmm 


•56668 




1-00000 

1-00000 

1-00000 


■ 

Hi 

1-00000 

1-00000 


For v x = x , xn =0 


































































178 Percentage points of the incomplete beta-function 


Beta Distribution : 1 per cent Points for % 


Vl =2q v s = 2p 


\ 

A 

1 

2 

3 

4 

5 

6 

7 

8 

9 

1 

0-0 a 24672 

o-onoooo 

0-0 4 61686 

0-0 4 44446 

0-0 4 34699 

0'0 4 28446 

0 0*24097 

0-0*20897 

0-0*18449 

2 

•019900 

■oioooo 

■0066778 

•0050126 

•0040121 

•0033445 

•0028674 

•0025094 

•0022309 

3 

•080827 

•046416 

•032834 

■026458 

•020807 

•017599 

■015252 

•013458 

•012043 

4 

■15875 

•10000 

•073960 

•058903 

•049014 

•041999 

•036754 

•032682 

■029426 

5 

0-23520 

0-15849 

012142 

0-098877 

0-083563 

0-072429 

0-063948 

O'057264 

0 061857 

6 

•30387 

•21544 

•16979 

•14087 

•12065 

•10564 

•094014 

•084730 

•077136 

7 

•36370 

•26827 

■21636 

•18236 

•15801 

•13959 

•12511 

•11341 

•10375 

8 

•41540 

•31623 

■25997 

•22207 

■19437 

•17307 

•15612 

■14227 

•13073 

» 

■46009 

•35938 

•30024 

•25945 

•22910 

•20543 

•18637 

•17066 

■15745 

10 

0-49889 

0-39811 

0-33719 

0-29431 

0 20191 

0-23632 

0-21551 

0-19820 

O'18355 

11 

•53279 

•43288 

•37099 

•32667 

•29271 

•26560 

•24335 

•22469 

•20879 

12 

•56268 

•46416 

•40191 

■35664 

•32153 

•29323 

•26981 

•25003 

•23307 

13 

•58893 

•49239 

•43020 

•38437 

•34845 

•31924 

•29487 

•27417 

•25631 

14 

■61238 

•51795 

•45615 

•41006 

•37358 

•34369 

•31858 

•29712 

■27851 

15 

0-63336 

0-54117 

0-47999 

0-43387 

0-39706 

0-36066 

0-34098 

0-31891 

0-20968 

16 

■65224 

•56234 

•50194 

•45697 

•41899 

•38826 

•36214 

•33958 

•31986 

17 

•66930 

•58171 

•52219 

•47651 

•43951 

•40857 

•38213 

•35920 

•33905 

18 

•68479 

•59948 

•54094 

•49565 

•45872 

•42768 

•40103 

■37781 

•35733 

19 

■69892 

•61585 

•56832 

■51350 

•47674 

•44568 

■41890 

•39547 

•37474 

20 

0-71185 

063096 

0-57447 

0-53018 

0-49366 

0-46206 

0-43581 

041224 

0-39131 

21 

•72372 

•6449.5 

•58952 

■54581 

•50058 

■47868 

■45184 

42818 

40711 

22 

•73467 

•65793 

•60357 

•66048 

•52450 

•49383 

■46703 

44333 

•42217 

23 

•74479 

•67002 

•61071 

•57422 

•53869 

•50816 

•48144 

45775 

•43653 

24 

■75417 

■68129 

■62903 

•58717 

•55204 

•52174 

•49514 

47149 

•45026 

25 

0-76290 

0-69183 

0-64059 

0-59938 

0-56466 

0-63461 

0-50816 

048458 

0-46335 

26 

■77103 

•70170 

■65147 

•61090 

•57660 

•54683 

•52055 

49706 

47587 

27 

•77862 

•71097 

•66172 

•62180 

•58793 

■55845 

•53236 

•50899 

48786 

28 

•78573 

•71969 

■67139 

•63211 

•59868 

■56951 

■54362 

•52038 

49932 

29 

•79240 

•72790 

•68054 

•64188 

•60890 

•58004 

•55437 

•53127 

•51031 

30 

0-79867 

0-73564 

0-68919 

0-66116 

0-61862 

049008 

0-56464 

0-54170 

0-52085 

40 

•84541 

•79433 

•75561 

•72316 

•09482 

•66950 

■64656 

•62555 

•60617 

60 

•89449 

•85770 

•82898 

•80433 

•78233 

•76227 

•74376 

•72651 

•71034 

120 

•94599 

•92612 

•91014 

■89607 

•88321 

•87124 

•85995 

•84924 

•83900 

00 

1-00000 

1-00000 

1-00000 

1-00000 

1-00000 

100000 

1-00000 

1-00000 

1-00000 


This table gives tho valuos of a; for which I x [p, g)=;0-01 where p-b'v <1=W 
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Beta Distribution : 1 per cent Points for x 


v i = 2 ? v t =2p 


V 

I'd \ 

10 

12 

15 

20 

24 

30 

40 

60 

120 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

Ifi 

10 

17 

18 

19 

20 
21 
22 

23 

24 

25 

26 

27 

28 

29 

30 
40 
60 

120 

oo 

0-0H6513 

•0020080 

■010898 

•026763 

0-047389 

•070804 

•095627 

•12095 

•14619 

0-17097 

•19506 

•21834 

■24073 

■26220 

0'28276 

■30240 

•32117 

•33910 

•35622 

037257 

•38818 

•40311 

•41738 

•43103 

0-44410 

•45601 

■46801 

•48011 

•49115 

0-50175 

•58819 

•69511 

•82918 

1-00000 

0’0 4 13647 

•0016737 

•0091569 

•022665 

0040434 

•060840 

•082714 

•10526 

■12796 

0-16044 

■17260 

■19398 

•21479 

•23489 

0-25426 

•27289 

•29079 

•30797 

•32440 

0-34029 

•35548 

•37006 

■38405 

■39749 

0-41040 

•42280 

•43473 

•44621 

•46726 

0-46789 

•55573 

■86701 

•81062 

100000 

0-0H0827 

•0013391 

•0073877 

•018436 

0-033149 
■050258 
•068820 
•088177 
•10787 . 

0-12760 

•14713 

■16633 

■18511 

•20338 

0-22113 

•23833 

•25497 

•27105 

•28658 

0-30157 

•31603 

•32999 

•34345 

•35645 

0-36899 

•38109 

•39278 

•40407 

•41497 

0-42552 

•61398 

•62969 

•78497 

1-00000 

0-0*8053I 

•0010046 

•0055887 

•014065 

0-025603 

•038982 

•053801 

■069456 

•085584 

0-10193 

•11830 

•13458 

•15065 

•16646 

0-18190 

•19711 

•21189 

•22630 

•24032 

0-25395 

•26721 

•28008 

•29258 

•30472 

0-31051 

•32795 

•33908 

•34985 

•36032 

0-37049 

•45778 

•57717 

■74677 

1-00000 

0-0*68831 

•0*83718 

•0046777 

•011824 

0-021534 

•033067 

•045816 

•059390 

•073472 

0-087838 

•10232 

■11681 

•13120 

•14544 

0-15948 

■17327 

■18681 

•20005 

•21301 

0'22567 

•23803 

•25008 

•26184 

■27329 

0-28446 

•29534 

•30694 

•31626 

•32632 

0-33612 

•42144 

•54167 

•71942 

1-00000 

0'0 5 53242 

■0 3 66980 

•0037588 

•0095436 

0-017459 

•026923 

•037481 

•048797 

•060623 

0-072776 

■085117 

■097542 

•10997 

•12235 

0-13462 

■14670 

•15873 

■17053 

•18212 

0-19351 

•20468 

•21563 

•22636 

•23687 

0-24716 

•26722 

•26707 

•27670 

•28012 

0-29534 

•37700 

•49647 

•68259 

1-00000 

0-0*39766 

•0*50239 

•0028317 

•0072226 

0-013275 

•020567 

•028767 

•037025 

•040956 

0-056621 

•066512 

•076547 

•086660 

•096802 

0-10693 

■11702 

■12704 

•13697 

•14680 

0-15651 

•16609 

•17554 

•18486 

■19403 

0-20305 

■21193 

•22066 

•22925 

•23768 

0-24597 

■32111 

■43655 

•62988 

I'OOOOO 

0-0*26400 

•0*33496 

•0018964 

•0048595 

0-0089747 

•013973 

•019640 

•025815 

•032376 

0-039229 

•046303 

•053541 

•060897 

•068334 

0-076824 

■083341 

•090806 

■098383 

•10588 

011334 

•12070 

•12812 

•13543 

•14268 

O'14986 
■15697 
•16401 
■17096 
•17785 

0-18465 

•24819 

•35258 

•54709 

TOOOOO 

0-0*13146 

•0*16749 

•0*95262 

•0024526 

0-0045520 

•0071235 

■010065 

•013300 

•016768 

0-020426 

•024237 

■028173 

•032212 

•036335 

0-040526 

■044772 

•049062 

•063386 

■067738 

0-002109 

•060404 

■070888 

•075285 

•079083 

0-084077 

•088466 

•092843 

•097210 

■10156 

0-10590 

■14811 

•22459 

•39479 

1-00000 


For ?!= co, a;= 0 



180 Percentage points of the incomplete beta-function 


Beta Distbibution: 0*5'mb cent Points fob x 
h~k v %-ty 


12 


13 

•53337 

14 

■55865 

15 

0-58144 

16 

-60200 

17 

■02080 

18 

•63789 

19 


20 


21 


22 

-69341 

23 

•70477 

24 

•71532 

25 

0-72516 

26 

•73434 

27 

•74294 

28 


29 

•75857 



120 63619 '91518 ■ 89893 -88442 -87120 -85892 - 84739 . -830 

oo 1-00000 1-00000 1-00000 1-00000 1-00000 1-00000 1-00000 1-000 

This table gives the values of x for which I„ ( p , 5 ) = 0-006 where p = iv it q=l 
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Beta Distribution : 0-5 per cent Points for x 


(*1=2? v i= 2j) 


\ 1 

V|\ 

10 

12 

16 

20 

24 

30 

40 

D 

120 

1 

0'0'41280 

0-0 6 34116 

0'0 8 27067 

0-0 5 20132 

0-0 5 I6707 

0-OM3310 

HII 


0-0*32862 

2 

•0010020 

-0 3 83507 

•O 3 06812 

■fl 3 50113 

•0=41762 

■0 3 33411 




3 

•0068204 

•0057290 

•0046206 


■0029242 

■0023493 



wmm 

4 

■018721 

■015844 

•012879 


•0082522 


ms 

n 

nyi 

5 

0 035415 

0-030191 

0-024729 

O'019006 

0-016040 



pH 


6 

•056299 

•047464 

•039162 

•030337 

•025709 

•020924 




7 

•077090 

•066693 

•065329 

•04*3189 

■036749 


KiMii'rfe 



8 

■099867 

•080787 

•072686 

•057076 

•048760 


•030828 

rmi 

sliffiffifll 

9 





■061433 

■ 



•013953 

m 

0-14606 

0 12831 

0 10862 

0086595 

0-074540 

0-061684 

0-047930 

0 033162 

0-017241 

n 

•16876 


•12683 

mSm 

■087903 




|jg|g||™| 

12 

•19092 

•16931 

■14489 

EMI 

•10139 

9 

•066252 

•046265 

jM 

13 

•21242 

•18919 

•10270 

iiiH 

•11490 

9 

•075662 

ESI 


14 

•23320 




■12836 



■a 

iSIMI 

15 



0-19726 


0-14170 


0-094697 

0-067019 

0-036743 

16 

•27248 

•24643 

•21388 

•17044 

•15488 

b 




17 





•16787 

•14241 

•11377 


•043741 

18 

■30872 

■27986 

•24576 


•18065 

•15373 

■12324 


■047815 

19 

•32674 

•29615 


•21829 

■19319 

•16489 

•13265 



20 

0-34206 

0-31184 



0-20549 


h 



21 


•32696 


•24458 

-21763 

•18673 

IH 


gEje||M 

22 

■37269 

•34151 

•30387 

•25723 

■22932 

■19738 

BtliSls 

•11676 

KIbSmm 

23 


•36552 

•31725 

•26964 

•24084 

E 

•16938 

E 

•068619 

24 

11111 

•36901 


•28153 

•25210 

•21811 

•17829 

KBl 

wmm 

25 


0-38200 

0-34272 

0-29320 

0-20309 



0-13772 

HJi 

26 

•42682 

■39452 

•35484 

EMM 

•27383 

H 

HI 

■14461 


27 




EMI 

■28432 

•24776 


•15143 

•085459 

28 


■41821 

•37792 

•32635 

•29455 

■25724 

•21266 

■15819 


29 


•42942 

•38891 

•33681 

•30454 

•26654 


•16489 


IB 



BUfl 


0-31429 

0-27565 

MH 



40 

iKmloil 

•53024 

HffiSjsjj 

•43493 

•39980 

•35700 

n 




•67384 

•64684 

HI 


•52194 

•47762 

•41913 

•33759 

■21421 


B Si 

Rpl||§ 

•77125 


■70531 

•66845 

mm 


Kiwi 

I| 





1-00000 

1-00000 

■ 

■ 

IHH 


For v x = oo, #=0 
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TABLE OF LAGRANGIAN COEFFICIENTS FOR HARMONIC 
INTERPOLATION IN CERTAIN TABLES OF 
PERCENTAGE POINTS 

Prepared by L. J. COMRIE and H. 0. HARTLEY 


Tliis table was prepared to facilitate interpolation in the tables of percentage points of the 
Incomplete Beta-Function in. the preceding paper. As, however, the main part of the table 
may be applied to interpolation in other tables with a similar lay-out, it is published 
separately. 

The table is based on harmonic interpolation—a device introduced by R. A. Fisher in 
his tables of percentage points of the distribution of z, whose two parameters (degrees of 
freedom) n x and n a range from 1 to oo. It consists in using values of n x and » 2 in harmonic 
progression, so that, with ljn l and l/n 2 as variables, z is tabulated at equidistant intervals 
near l/n L = 0 and also near l/n a = 0. This transformation renders the z-table (apart from 
its singularity at n x = co, % = oo) interpolate. As the percentage points of the Incomplete 
Beta-Function show a similar behaviour,* the values of v x and v t near the margin of the 
tables have been chosen in harmonic progression. Harmonic interpolation is, in fact, 
applicable to any table of percentage points (depending on a parameter n with an infinite 
range) in which the statistic can be adequately represented as a polynomial in 1/n.f It is 

Table of Lagrangian Coefficients 

Column headings are the arguments of tabular values and 
row headings the arguments of the interpolate. 





Ordinary 





7 

8 

9 

10 

12 

15 

20 



4 - 

- 

+ 

4 * 

- 

. + 

11 

0 009 231 

0-428 571 

1-090 909 

1-440 000 

0-300 000 

0-008 571 

0-000 140 


8 

9 

10 

12 

15 

20 

24 


— 

4 " 


+ 

4 * 


+ 

13 

0-171 875 

0-777 778 

1-100 000 

1-336 805 

0-162 963 

0-006 250 

0-000 679 

14 

0-223 214 

0-969 697 

1-285 714 

1-041 667 

0-507 936 

0-011 364 

0-000 992 


9 

10 

12 

15 

20 

24 

30 


— 

4 * 

- 

4 - 

4 * 

- 

+ 

16 

0-172 391 

0-448 000 

0-604 938 

1-238 914 

0-106 909 

0-017 284 

0-000 790 

17 

0-300 397 

0-780 000 

0-983 025 

1-258 272 

0-289 546 

0-040 124 

0-001 728 

18 

0-332 408 

0-833 143 

1-000 000 

1-024 000 

0-530 182 

0-067 143 

0-002 286 

19 

0-222 222 

0-550 000 

0-636 574 

0-670 370 

0-787 500 

0-050926 

0-001 862 




Harmonio 





10 

12 

15 

20 

24 

30 

40 



- 

+ 

A- 

- 

4 * 

-- 

16 

0-003 052 

0-039 551 

0-692 139 

0-769 043 

0-632 813 

0-247 192 

0-039 062 

17 

0-003 097 

0-037 459 

0-409 711 

1-213 958 

0-856 212 

0-315 162 

0-048 257 

18 

0-001 996 

0-022 993 

0-201 189 

1-341 259 

0-735 777 

0-251 486 

0-037 160 

19 

0-000 818 

0-009 091 

0-069 601 

1-237 345 

0-407 263 

0-126 647 

0-017 957 


* This issue, pp. 168-81. 

-j- It can be shown that any so-called “studentized” statistic has this property. 





21 

22 

23 


23 

26 

27 

28 
29 


31 ( 15 - 6 ) 

32 (16 0 ) 

33 ( 10 - 6 ) 

34 (17 0 ) 

35 ( 17 : 5 ) 
36 ( 18 - 0 ) 
37 ( 18 - 6 ) 
3 # ( 19 - 0 ) 
39 ( 19 - 6 ) 


41 ( 20 - 5 ) 
42 ( 21 - 0 ) 

43 ( 21 - 6 ) 

44 ( 22 - 0 ) 

46 ( 22 - 5 ) 

46 ( 23 - 0 ) 

47 ( 23 - 6 ) 

48 ( 24 - 0 ) 

49 ( 24 - 6 ) 

50 ( 26 - 0 ) 
61 ( 26 - 6 ) 
62 ( 26 - 0 ) 
53 ( 26 - 5 ) 
64 ( 27 - 0 ) 

66 ( 27 - 6 ) 
66 ( 28 - 0 ) 

67 ( 28 - 6 ) 

68 ( 29 - 0 ) 

69 ( 29 - 6 ) 


61 ( 30 - 6 ) 

62 ( 31 - 0 ) 

63 ( 31 - 6 ) 

64 ( 32 - 0 ) 

65 ( 32 - 6 ) 

66 ( 33 - 0 ) 

67 ( 33 - 6 ) 

68 ( 34 - 0 ).' 

69 ( 34 - 6 ) 


Table of Lagfangian Coefficients (continued) 


Harmonic 


12 

+ 

0-000 000 
0-000 579 
0-000 300 

16 

+ 

0-000 730 
0-000 993 
0-000 927 
0-000 670 
0-000 335 

20 

( 10 ) 

4 - 

0-003 664 
0-005 876 
0-006 875 
0-006 948 

0-006 358 
0-005 335 
0-004 062 
0-002 084 
0-001 305 


0-001 182 
0-002 210 
0-003 069 
0-003 764 

0-004 268 
0-004 619 
0-004 819 
0-004 883 
0-004 824 

0-004 669 
0-004 402 
0-004 068 
0-003 669 
0-003 220 

0-002 730 
0-002 210 
O-OOJ 669 
0-001116 
0-000 658 

+ 

0-000 652 
0-001 093 
0-001 619 
0-002 128 

0-002 616 
0-003 082 
0-003 624 
0-003 940 
0-004 329 


16 

0-010 497 
0-009 653 
0-004 908 

20 

0-040 868 
0-050 984 
0-044 488 
0-030 498 
0-014 607 

24 

( 12 ) 

0-041 469 
0-063 446 
0-071 503 
0-070 033 

0-002 423 
0-051 212 
0-038 250 
0-024 848 
0011 904 


+ 

0-010 511 
0-019 448 
0-026 748 
0-032 432 

0-036 680 
0-039 302 
0-040 733 
0-041 016 
0-040 293 

0-038 707 
0-036 392 
0-033 473 
0-030 065 
0-020 273 

0-022 191 
0-017 901 
0-013 477 
0-008 983 
0-004 475 


0-004 402 
0-008 695 
0-012 864 
0-016 863 

0 020 675 
0-024 304 
0-027 730 
0-030 945 
0-033 942 


20 

-t 

0-029 840 
0-337 838 
0-130 868 

24 

+ 

0-817 162 
0-611 813 
0-415 220 
0-243 980 
0-105 172 

30 

( 15 ) 

+ 

0-906 909 
0-793 076 
0-670 341 
0-547 129 

0-429 158 
0-320 073 
0-221 984 
0-136 887 
0-081 908 


0-060 764 
0-091 161 
0-122 104 
0-144 787 

0-160 037 
0-168 878 
0-172 219 
0-170 899 
0-166 679 

0-167 248 
0-146 218 
0-133 132 
0-118 464 
0-102 630 

0-085 989 
0-068 849 
0-051 474 
0-034 088 
0-016 878 

4 - 

0-016 417 
0-032 268 
0-047 471 
0-061 959 

0-075 683 
0-088 608 
0-100 709 
0-111 971 
0-122 388 


24 

+ 

0-537 463 
0-864 864 
1-006 068 

30 

4 " 

0-306 432 
0-673 575 
0-778 537 
0-914 926 
0-986 984 

40 

( 20 ) 

+ 

0-179 142 
0-352 478 
0-510 736 
0-848 460 

0-762 947 
0-853 629 
0-920 823 
0-966 308 
0-991 960 

+ 

0-992 724 
0-972 384 
0-941 117 
0-900 900 

0-863 629 
0-800 606 
0-743 549 
0-683 594 
0-621 808 

0-659 104 
0-496 256 
0-433 910 
0-372 604 
0-312 777 

0-254 781 
0-198 897 
0-146 330 
0-094 268 
0-045 797 


0-043 083 
0-083 441 
0-121 085 
0-156 045 

0-188 368 
0-218 113 
0-245 348 
0-270 161 
0-292 606 


30 

0-209 947 
0-263 378 
0-168 259 

40 

0-108 954 
0-174 804 
0-101640 
0162 663 
0-096 611 

60 

( 30 ) 

0-062 546 
0-113 297 
0-148 964 
0 - 168 348 

0-171 663 
0-160 037 
0136 121 
0-098 827 
0-063 141 

+ 

0-068 780 
0-121 648 
0-186 839 
0-253 378 

0-320 073 
0-386 006 
0-460 419 
0-612 696 
0-572 340 

0-628 992 
0-682 351 
0-732 223 
0-778 470 
0-821 038 

0-869 886 
D-SOF 035 
0-926 536 
0-964 463 
0-978 914 

1-017 846 
1-032 585 
1-044 367 
1-053 303 

1-069 570 
1-063 300 
1-064 636 
1-063 721 
1-060 092 


40 

4 - 

0-060 616 
0-088 640 
0-042 230 

60 

+ 

0-029 184 
0-044 986 
0-047 184 
0-038 122 
0-021 204 

120 

( 60 ) 

+ 

0-016 304 
0-028 839 
0-030 984 
0-040 717 

0-040 301 
0-036 580 
0-020 965 
0-021 212 
0-011 022 


0-011 310 
0-022 440 
0-033 000 
0-042 674 

0-051 212 
0-068 422 
0-064 160 
0-068 350 
0-070 939 

0-071 886 
0-071 202 
0-068 915 
0-065 067 
0-069 712 

0-052 016 
0-044 752 
0 036 297 
0-024 631 
0-012 838 

+ 

0-013 801 
0-028 485 
0-043 973 
0-060 189 

0-077 060 
0-094 516 
0-112 490 
0-130 919 
0-149 745 


60 

0-008 076 
0-008 890 
0-005 306 

120 

0-003 686 
0-006 670 
0-005 740 
0-004 546 
0-002 477 


00 

<C0) 


0-002 

015 

0-003 

525 

0-004 

469 

0-004 

863 

0-004 

768 

0-004 

268 

0-003 

453 

0-002 

416 

0-001 

240 

"h 

0-001 

241 

0-002 

431 

0-003 

520 

0-004 

505 

0-005 

335 

0-006 

006 

0-006 

BOO 

0006 

836 

0-000 

995 

0-000 

989 

0-006 

824 

0-006 

509 

0-000 

055 

0-005 

474 

0-004 

777 

0-003 

978 

0-003 

088 

0-002 

121 

0001 

088 

0-001 

131 

0-002 

295 

0-003 

481 

0-004 

681 

0-005 

886 

0-007 

089 

0-008 

281 

0-009 

455 

0-010 

607 


Table of Lagrang-ian Coefficients (continued) 


Harmonic 



20 

24 

30 

40 

60 

120 

oo 


( 10 ) 

4 * 

( 12 ) 

( 16 ) 

( 20 ) 

( 30 ) 

( 60 ) 

( co ) 

70 ( 35 - 0 ) 

0-004 692 

0-036 719 

0-131960 

0-312 795 

+ 

1-055 683 

+ 

0*168 909 

0-011 730 

71 ( 36 - 6 ) 

0-006 027 

0-039 276 

0-140 696 

0-330 812 . 

1-048 823 

0-188 360 

0-012 819 

72 (36 0 ) 

0-006 336 

0-041 610 

0-148 605 

0-346 746 

1-040238 

0-208 048 

0-013 870 

73 ( 36 ’ S ) 

0-006 616 

0-043 726 

0-155 705 

0-360 690 

1-030 048 

0-227 925 

0-014 878 

74 ( 37 > 0 ) 

0-006 867 

0-046 623 

0-162 013 

0-372 737 

1-018 370 

0-247 951 

0-016 841 

76 ( 37 - 6 ) 

0-006 093 

0-047 309 

0-107 562 

0-382 976 

1-006 312 

0-268 083 

0-016 755 

76 ( 38 ’ 0 ) 

0-006 292 

0-048 787 

0-172 345 

0-391 499 

0-990 981 

0-288 285 

0-017 617 

77 ( 38 - 6 ) 

0-006 465 

0-050 062 

0-176 416 

0-398 393 

0-976 477 ' 

0-308 523 

0-018 426 

78 ( 39 - 0 ) 

0 006 613 

0-051 141 

0-179 793 

0-403 745 

0-968 894 

0-328 764 

0019 178 

79 ( 39 - 6 ) 

0-006 736 

0-062 030 

0-182 602 

0-407 639 

0-941 324 

0-348 979 

0-019 872 

80 (40 0 ) 

0-006 830 

0-052 734 

0 - 184.570 

0-410 156 

O '-922 851 

0-369 141 

0020 508 

81 ( 40 - 6 ) 

0 - 006.912 

0-053 262 

0-186 027 

0-411 376 

0-903 557 

0-389 225 

0021 083 

82 ( 41 - 0 ) 

0-006 967 

0-053 621 

0-186 898 

0-411 373 

0-883 618 

0-409 208 

0-021 597 

83 ( 41 - 6 ) 

0-007 000 

0-053 816 

0-187 212 

0-410 223 

0-862 806 

0-429 071 

0-022 049 

84 (42 0 ) 

0 007 012 

0-063 866 

0-186 997 

0-407 993 

0-841 486 

0-448 793 

0-022 440 

86 ( 42 ’ 6 ) 

0-007 006 

0 053 746 

0-186 279 

0-404 753 

0-819 625 

0-468 367 

0-022 767 

86 ( 43 - 0 ) 

0-006 980 

0-053 495 

0-185 083 

0-400 567 

0-797 283 

0-487 749 

0-023 033 

87 ( 43 - 6 ) 

0-006 930 

0-053 110 

0-183 437 

0-395 496 

0-774 514 

0-606 964 

0-023 235 

88 ( 44 - 0 ) 

0-006 876 

0-062 696 

0 -lil 366 

0-389 600 

0-751 3 ll 

0-625 960 

0 023 376 

89 ( 44 - 5 ) 

0-006 798 

0-051 961 

0-178 892 

0-382 934 

0-727 905 

0-544 765 

0-023 455 

90 ( 46 - 0 ) 

0-006 700 

0 051 212 

0-176 040 

0-375 552 

0-704161 

0-563 329 

0-023 472 

91 ( 46 - 6 ) 

0-000 600 

0-060 354 

0-172 833 

0-307 506 

0-680 182 

0-581 673 

0-023 428 

92 (40 0 ) 

0-006 479 

0-049 393 

0-169 293 

0-368 843 

0-666 009 

0-599 780 

0-023 325 

93 ( 46 - 6 ) 

0-006 346 

0-048 337 

0-165 440 

0-349 609 

0-031 680 

0-617 042 

0-023 162 

94 ( 47 - 0 ) 

0-006 200 

0-047 190 

0-161 296 

0-339 848 

' 0-607 229 

0-636 264 

, 0-022 940 

96 ( 47 - 6 ) 

0 006 043 

0-045 959 

0-156 878 

0-329 602 

0-682 689 

0-652 011 

0-022 660 

96 ( 48 . 0 ) 

0-006 876 

0-044 647 

0-152 207 

0-318 909 

0-668 090 

0-669 708 

0 022 324 

97 ( 48 - 6 ) 

0-005 696 

0-043 262 

0-147 299 

0-307 806 

0-633 462 

0-686 542 

0 - 021 ' 931 

98 ( 49 - 0 ) 

0-006 609 

0-041 806 

0-142 173 

0-296 330 

0-608 829 

0-703 109 

0-021 484 

99 ( 49 - 6 ) 

0-006 312 

0-040 287 

0-136 844 

0-284 511 

0 484 217 

0*719 408 

0-020 983 

100 (60 0 ) 

0-006 107 

0-038 707 

0-131 328 

0-272 384 

0-469 648 

0-736 437 

0-020 429 

101 ( 60 - 6 ) 

0 004 894 

0-037 072 

0-125 640 

0-259 976 

0-435 143 

0-751194 

0-019 823 

102 ( 61 - 0 ) 

0-004 675 

0-035 385 

0-119 793 

0-247 316 

0-410 721 

0-766 679 

0-019 167 

103 ( 61 - 5 ) 

0-004 448 

0 033 661 

0-113 802 

0-234 429 

' 0-386 400 

0-781 891 

0-018 461 

104 ( 62 - 0 ) 

0-004 210 

0-031 873 

0-107 680 

0-221 342 

0-362 196 

0-796 831 

0-017 708 

106 ( 62 - 6 ) 

0-003 978 

0-030 056 

0-101 437 

0-208 077 

0-338 125 

0-811 499 

0-016 906 

106 (63 0 ) 

0-003 735 

0-028 201 

0-095 087 

0-194 656 

0-314 199 

0-826 896 

0-016 059 

107 ( 63 - 6 ) 

0-003 486 

0-026 314 

0-088 639 

0-181 099 

0-290 433 

0-840 022 

0-015 167 

108 ( 64 - 0 ) 

0-003 234 

0-024 307 

0-082 104 

0-167 427 

0-206 837 

0-853 880 

0-014 231 

109 ( 64 - 6 ) 

0-002 978 

0-022 462 

0-075 492 

0-153 658 

0-243 423 

0-867 470 

0-013 263 

110 ( 66 - 0 ) 

0-002 719 

0-020 484 

0-068 812 

0-139 809 

0-220 199 

0-880 796 

0-012 233 

111 ( 66 - 6 ) 

0-002 456 

0-018 494 

0-062 074 

0-125 896 

0-197 175 

0-893 868 

0-011173 

112 ( 68 - 0 ) 

0-002 190 

0-016 485 

0-055 284 

0-111 933 

0-174 368 

0-906 660 

0-010 074 

113 ( 66 - 6 ) 

0-001 922 

0-014 459 

0-048 452 

0-097 930 

0-161 755 

0-919 203 

0-008 937 

114 ( 67 - 0 ) 

0-001 661 

0-012 420 

0-041 584 

0-083 918 

0-129 374 

0-931 491 

0-007 762 

116 ( 67 - 6 ) 

0-001 379 

0-010 368 

0-034 688 

0-069 891 

0-107 219 

0-943 525 

0 006 552 

116 (58 0 ) 

0-001106 

0-008 307 

0-027 770 

0-055 866 

0-085 295 

0-955 309 

0-005 307 

177 ( 68 - 6 ) 

0-000 831 

0-006 238 

0-020 837 

0-041 854 

0-063 008 

0-906 845 

0-004 029 

118 ( 69 - 0 ) 

0-000 564 

0-004 162 

0-013 894 

0-027 867 

0-042 101 

0-978 137 

0-002 717 

119 ( 69 - 6 ) 

0-000 278 

0-002 083 

0-006 947 

0-013 913 

0 020 957 

0-989 188 

0-001 374 
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188 Lagrangian coefficients for harmomc interpolation 

particularly convenient if the high-order terms are so small that linear interpolation 
suffices, Unfortunately, however, if the interpolate is required to the same accuracy as the 
tabular values, linear interpolation in the above-mentioned tables is inadequate; hence this 
table has been prepared as the simplest method of preserving tabular accuracy in the 
interpolates. The interpolate is the sum of seven products of which the seven (Lagrangian) 
multipliers are taken from this table whilst the multiplicands are tabular entries in 
the table of percentage points. The examples below illustrate the use of the table. 

The calculation of the Lagrangian coefficients follows the standard formulae for inter¬ 
polation by a polynomial of the sixth degree. Where interpolation is harmonic the coeffi¬ 
cients are those for polynomials of the sixth degree in the reciprocal of the parameters 
used as argument. 

In the early part of the table, however, ordinary Lagrangian coefficients are given, 
since the polynomial in the parameter itself is preferable in this range. The part of the table 
with row headings less than 30 has been specifically designed to meet the requirements of 
the tables of percentage points of the Incomplete Beta Function. In particular it will be 
noted that there are two rows for 16, 17, 18 and 19, one giving harmonic and the other 
ordinary Lagrangian coefficients; the application of both rows affords a good check, as 
will be seen from Example 2 on p. 162 in the preceding paper. 

The table may be used not only for the progression 10, 12,16, 20, 24, 30, 40, 60, 120, oo, 
but also for submultiple progressions. The most important of these, namely 10, 12, 16, 20, 
30, 60, oo, is obtained hy halving the last seven terms, and is catered for by the auxiliary 
arguments in brackets. Division' by 4 yields the progression 6, 6, 7-6, 10, 16, 30, oo, while 
division by 6 yields 3, 4, 4-8, 6, 8,' 12, 24, oo, from which we can select the first seven or the 
last seven values. 

The missing values 7-6 or 4-8 can be found by ordinary interpolation from values in 
their immediate neighbourhood. If linear interpolation does not suffice we may use 

/(?.• 6) ~/(6) + 9/(7) + 9/(8) —/(9)}, 

which takes third differences into account, and 

/(4-8) = 0-12/(4) + 0-96/(5) — 0-0B/(6), 

which takes second differences into account. Since/(7-5) and/(4-8) are not required to full 
tabular accuracy these formulas will, as a rule, suffice. 


Example 1. Find the 0-6 % point of the Incomplete Beta Function corresponding to 
v 1 = 4 and i> 2 = 96. 

In the accompanying table enter row 96, which gives the Lagrangian multipliers. The 
corresponding multiplicands are taken from, the column of the table of 0-6 % points on 
*(96, 4) = +0-491 44 x 0-006 876 p, 180 and are the entries for v a = 20, 24, 30, 40, 60, 120 

and oo, which correspond to the column headings in the 
Lagrangian table. The sign of each product is also given 
at the top of the columns. We have, therefore, the soheme 
shown alongside. The result is two units greater in the 
fifth decimal than the exact value obtained by inverse 
interpolation in Pearson’s tables. 

Example 2. In Fisher’s table of 1 % points of the distribution of z find the point 
a(12, 64) = +0-7744 x 0-003 23 corresponding to n 1 = 12 and n 2 = 64. 

In Fisher’s table we have the harmonic progression 
10, 12, 16, 20, 30, 60, co, for whioh our table provides 
by means of the arguments in brackets. Using the 
Lagrangian multipliers, and the tabular entries corre¬ 
sponding to the bracketed column headings in row (64), 
we have the values alongside. 


-0-660 98 x 0-044 647 
+0-618 64x0-162 207 
-0-696 71 x 0-318 909 
+0-783 70 x0-668 090 
+ 0-884 42x0-669 708 
-1-000 00 x 0-022 324 
= 0-857 04 


-0-7122 x 0-024 40 
+0-6496 x 0-082 10 
-0-5864 x 0-167 43 
+ 0-6224x0-266 84 
+ 0-4574x0-863 88 
-0-3908x0-014 23 
= 0-4647 
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TABLE OE PERCENTAGE POINTS OP THE 
y a DISTRIBUTION 

Calculated by CATHERINE M. THOMPSON 
Editorial 

In the calculation of the percentage points of the incomplete S-function 
I x {p, q) for large values of p and q described in the preceding paper, use has been 
made of the relation between this function and the incomplete /’-function 
I(u, p*)f which was tabulated by Karl Pearson ( 2 ). Since the incomplete /'-func¬ 
tion is related to the probability integral of y a , it was decided that the calculation 
of the percentage points of u already carried out should be extended to form 
complete tables of percentage points of y 2 . Before describing the method of com¬ 
putation used in deriving these tables it is desirable to relate them to existing 
tables and to define the relation between the functions. 

In common terminology the probability distribution of y 2 having v degrees 
of freedom may be written 

fix*) = (1) 

The probability integral of y 2 or the chance that this quantity exceeds a 
given value y 2 is then 

P = W)= /(xW- (2) 

J x' 

Conversely, for given degrees of freedom v, the integral (2) will be equal to 
a given probability level P for one particular lower limit y 2 . This lower limit is 
called the percentage point y 2 corresponding to v and P ; it will be denoted by y 2 (P). 

These percentage points y 2 (P) were first tabulated by R. A. Fisher (i) for 
P = 0-99, 0-98, 0-95, 0-90, 0-80, 0-70, 0-60, 0-30, 0-20, 0-10, 0-05, 0-02, 0-01$ and 
for v = 1 (1)30. Most entries in the body of Fisher’s table are given to three 
decimal accuracy (i.e. four to five significant figures) but more decimals are given 
for small percentage points which are given to three-figure accuracy. In the table 
which follows yf(P) is tabulated for P = 0-995, 0-99, 0-975, 0-95, 0-90, 0-76, 0-50, 
0-25, 0-10, 0-05, 0-026, 0-01, 0-005 (which are the levels used for I x (p, q)) whilst 
the range of the degrees of freedom has been extended up to v = 100. The per¬ 
centage points are given to six significant figures, although for v > 50 the sixth 
figure may be in error by one or two units. 

-|- Karl Pearson’s notation was I{u,p). To avoid confusion with the parameter p in I x (p, l) we 
havs added the asterisk. 

3 In a later edition the level P — 0-001 was added. 

i3-a 
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Table of percentage points of the distribution 


Table of percentage points of the x 2 distribution 


H 

0-996 

0-990 

0-975 

0-950 

0-900 

0-750 

i 

392704.10- 10 

157088.10-» 

982069.10- 8 

393214.10- B 

0-0157908 

0*1015308 

2 

0-0100251 

0-0201007 

0-0506356 

0-102587 

0-210720 

0-575364 

3 

0-0717212 

0*114832 

0-215795 

0-361846 

0-584375 

1-212534 

4 

0-206990 

0-297110 

0-484410 

0-710721 

1-063023 

1-92255 

5 

0-411740 

0-654300 

0-831211 

1-145476 

1-61031 

2-67460 

6 

0-675727 

0-872085 

1-237347 

1-63539 

2-20413 

3-45400 

7 

0-989205 

1-239043 

1-68987 

2-16735 

2-83311 

4-26485 

8 

1-344419 

1-646482 

2-17973 

2-73264 

3-48054 

5-07064 

9 

1-734926 

2-087912 

2-70039 

3-32511 

4-16816 

6-89883 

10 

2-15585 

2-55821 

3-24697 

3-94030 

4-80518 

0-73720 

11 

2-60321 

3-06347 

3-81575 

4-67481 

5-57779 

7-58412 

12 

3-07382 

3-57056 

4-40370 

5-22603 

6-30380 

8-43842 

13 

3-56503 

4-10691 

5-00874 

5-89186 

7-04160 

9-29906 

14 

4-07468 

4-60043 

5-62872 

6-57063 

7-78853 

10-1653 

15 

4-60094 

5-22936 

6-26214 

7-26094 

8-54675 

11-0365 

16 

5-14224 

6-81221 

0-90706 

7-96164 

9-31223 

11-9122 

17 

5-69724 

6-40776 

7-56418 

8-07170 

10-0862 

12-7919 

18 

6-26481 

7-01491 

8-23075 

9-39040 

10-8649 

13-0763 

10 

0*84398 

7-63273 

8-90666 

10-1170 

11-6609 

14-5620 

20 

7-43386 

8-26040 

9-69083 

10-8508 

12-4426 

16-4518 

21 

8-03366 

8-89720 

10-28293 

11-5913 

13-2390 

16-3444 

22 

8-64272 

9-64249 

10-9823 

12-3380 

14-0415 

17-2390 

23 

9-26042 

10-19567 

11-6886 

13-0905 

14-8479 

18-1373 

24 

9-88623 

10-8564 

12-4011 

13-8484 

16-0587 

18-0372 

2 6 

10-5197 

11-5240 

13-1197 

14-6114 

16-4734 

19-8393 

26 

11-1603 

12-1081 

13-8439 

15-3791 

17-2019 

20-8434 

27 

11-8076 

12-8786 

14-5733 

16-1513 

18-1138 

21-7404 

28 

12-4613 

13-5648 

15-3079 

16-9279 

18-9392 

22-6572 

29 

13-1211 

14-2565 

10-0471 

17-7083 

19-7677 

23-6666 

30 

13-7867 

14-9535 

16-7908 

18-4926 

20-5992 

24-4776 

40 

20-7085 

22-1643 

24-4331 

26-6093 

29-0606 

33-6603 

50 

27-9907 

29-7067 

32-3674 

34-7642 

37-0880 

42-8421 

60 

35-5346 

37-4848 

40-4817 

43-1879 

40-4689 

62-2938 

70 

43-2752 

45-4418 

48-7576 

51-7393 

56-3290 

01-6083 

80 

51-1720 

63-5400 

57-1532 

60-3915 

64-2778 

71-1445 

90 

59-1963 

61-7541 

65-6466 

69-1260 

73-2912 

80-0247 

100 

67-3276 

70-0648 

74-2219 

77-9296 

82-3581 

90-1332 

Vr 

-2-5758 

-2-3263 

-1-0600 

-1-0440 

-1-2816 

-0-6745 


For 30 < v < 100 interpolation formulae (6) or (7) of the Introduction may he used. 
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Table of percentage points of the distribution ( continued ) 


K 

0-500 

0-250 

0-100 

0-050 

0-025 

0-010 

0-005 

1 

0-454937 

1-32330 

2-70554 

3-84146 

5-02389 

6-63490 

7-87944 

2 

1-38629 

2-77259 

4-60517 

5-99147 

7-37776 

9-21034 

10-5966 

3 

2-36597 

4-10836 

6-25139 

7-81473 

9-34840 

11-3449 

12-8381 

4 

3-35670 

5-38527 

7-77944 

9-48773 

11-1433 

13-2767 

14-8602 

5 

4-35146 

6-62568 

9-23635 

11-0705 

12-8325 

15-0863 

16-7496 

0 

5-34812 

7-84080 

10-6446 

12-5916 

14-4494 

16-8119 

18-5476 

7 

6-34581 

9-03715 

12-0170 

14-0671 

16-0128 

18-4753 

20-2777 

8 

7-34412 

10-2188 

13-3616 

15-5073 

17-5346 

20-0902 

21-9560 

9 

8-34283 

11-3887 

14-6837 

16-9190 

19-0228 

21-6660 

23-5893 

10 

9-34182 

12-5489 

16-9871 

18-3070 

20-4831 

23-2093 

25-1882 

11 

10-3410 

13-7007 

17-2760 

19-6751 

21-9200 

24-7250 

26-7569 

12 

11-3403 

14-8454 

18-5494 

21-0261 

23-3367 

26-2170 

28-2995 

13 

12-3398 

15-9839 

19-8119 

22-3621 

24-7356 

27-6883 

29-8194 

14 

13-3393 

17-1170 

21-0642 

23-6848 

26-1190 

29-1413 

31-3193 

15 

14-3389 

18-2451 

22-3072 

24-9958 

27-4884 

30-5779 

32-8013 

10 

16-3386 

19-3688 

23-5418 

26-2962 

28-8454 

31-9999 

34-2672 

17 

16-3381 

20-4887 

24-7690 

27-5871 

30-1910 

33-4087 

35-7185 

18 

17-3379 

21-6049 

26-9894 

28-8693 

31-5264 

34-8053 

37-1564 

19 

18-3376 

22-7178 

27-2036 

30-1435 

32-8523 

36-1908 

38-5822 

20 

19-3374 

23-8277 

28-4120 

31-4104 

34-1696 

37-5662 

39-9968 

21 

20-3372 

24-9348 

29-6151 

32-6705 

35-4780 

38-9321 

41-4010 

22 

21-3370 

26-0393 

30-8133 

33-9244 

36-7807 

40-2894 

42-7960 

23 

22-3309 

27-1413 

32-0069 

35-1725 

38-0757 

41-6384 

44-1813 

24 

23-3307 

28-2412 

33-1983 

36-4151 

39-3641 

42-9798 

45-6586 

25 

24-3366 

29-3389 

34-3816 

37-6525 

40-6465 

44-3141 

46-9278 

26 

25-3304 

30-4345 

35-5631 

38-8852 

41-9232 

46-6417 

48-2899 

27 

26-3363 

31-6284 

36-7412 

40-1133 

43-1944 

46-9630 

49-6449 

28 

27-3363 

32-6205 

37-9159 

41-3372 

44 - 46P7 

48-2782 

' 50-9933 

29 

28-3362 

33-7109 

39-0875 

42-5669 

45-7222 

49-5879 

62-3356 

30 

29-3360 

34-7998 

40-2560 

43-7729 

46-9792 

50-8922 

53-6720 

40 

39-3364 

46-6160 

51-8060 

55-7586 

59-3417 

63-6907 

66-7669 

60 

49-3349 

66-3336 

63-1071 

67-5048 

71-4202 

76-1539 

79-4900 

60 

59-3347 

66-9814 

74-3970 

79-0819 

83-2976 

88-3794 

91-9517 

70 

69-3344 

77-5766 

85-5271 

90-5312 

95-0231 

100-425 

104-215 

80 

79-3343 

88-1303 

96-5782 

101-879 

106-629 

112-329 

116-321 

90 

89-3342 

98-6499 

107-565 

113-145 

118-136 

124-116 

128-290 

100 

99-3341 

109-141 

118-498 

124-342 

129-561 

135-807 

140-169 

2/p 

0-0000 

+ 0-6745 

+ 1-2816 

+ 1-6449 

+ 1*9600 

+ 2-3263 

j + 2-6758 


Eor v > 100 taka Xv(P) =v|l — J^ + yp or X%( p ) = M2/p+-s/( 2 '' " 1 )) 2 ' 

according to the degree of acouracy required. 


13-3 



190 Table of percentage points of the y 2 distribution 

The relation with the incomplete I -function is given by 

W)- (3) 

where y 2 = 2u<J(p*-\- 1), (4) 

the degrees of freedom, v, of y 2 being given by 

v = 2p* + 2. (5) 

The above relations were used for the computation of y 2 (P). Most of tlie 
entries were obtained from the Tables of the Incomplete- T-Function (2). In these 
tables the column headed p* = lv — 1 was entered, the root a of 

J(«, p*) = 1 - P 

found by inverse interpolation and transformed into the corresponding percen¬ 
tage point for y 2 Tby substitution in equation (4). 

Although for small values of p* and a the table of I(u, p*) is not interpolable, 
formal inverse interpolation in this range of the table still yields approximate 
values of the percentage points. To make these accurate, auxiliary tables of 
P„(y 2 ) were construetedfor arguments y 2 in the neigh bourhood of the approximate 
percentage points, and the exact values of y 2 (P) found by inverse interpolation 
in these auxiliary tables. The latter were constructed from the expansion of 
P v (% 2 ) given on p. xxxi of Tables for Statisticians and Biometricians, Part 1(3). 
Since the auxiliary tables were required only for small values of y a a few terms in 
the expansions were sufficient to yield i^(y 2 ) to the required accuracy. 

Whilst the existing table of percentage points of y 2 (i) is confined to the range 
v =s 1 (1) 30, the range v ~ 30 (10) 100 has been added in the table below. This has 
been done because the customary approximation to y 2 (F) by the corresponding 
normal deviates is not veyy satisfactory in this range of v. There are, in fact, 
percentage points which differ from the approximate ones in the second significant 
figure. In our table, however, linear interpolation (whioh is particularly con¬ 
venient at interval 10) yields interpolates accurate to about four significant 
figures. If we write v •— Wa rn with 3 < k < 10 and 0 < m < 10 then w e have 

Xv = M 10 ~ m ) Xlou +^Xioft+io}- (0) 

Tor instance for v - 54 and P = 0-01 we have 

A(O-Ol) - &{6yto(O-Ol) + 4yf o (0.01)} = 81-04. 

If higher accuracy is required we have to use the four point Lagrangian formula, 
viv„ 

Xv = b-i X%k-u + Lq xiok -b Li Xwk+10 T -^2 Xiofc+ao (?) 

where the Lagrangian coefficients W L , L 0 , L x and L., are tabulated below. 
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m 

h_i 

4 

4 

4 



- 

4- 

+ 

— 


0 

0-0000 

1-0000 

0-0000 

0-0000 

10 

1 

0-0285 

0-9405 

0-1045 

0-0165 

9 

2 

0-0480 

0 - 86-10 

0-2100 

0-0320 

8 

3 

0-0595 

0-7735 

0-3315 

0-0455 

7 

4 

0-0040 

0-0720 

0-4480 

0-0560 

6 

5 

0-0025 

0-5625 

0-5025 

0-0025 

5 


- 

+ 

-L 

- 



4 

hi 

4 

4 X 

m 


Returning to the above example we obtain 

Xm(O-OI) = - 0-0640xlo(0-01) + 0-6720 / \i(0-01) + 0>4480; V | 0 (0-01) - 0-0560 A ? 0 (0-Ql) 
= 81-069 

which is the exact interpolate to five figures. 

For v > 100 we can make use of Fisher’s approximation to 1\.(X‘) by the normal 
probability integral and. calculate X[P) from the formula 


Xv(P) = l{yr+*J{2v- 1)} 2 ( 8 ) 

where the normal deviates y P corresponding to the thirteen percentage levels 
are given in the last line of the main table. For v - 100, this approximation has 
an accuracy of about 1 % in the worst cases, but as v increases it becomes more 
accurate. 

A more accurate approximation has been given by Wilson and Hilferty(-i); 
this assumes that (y 2 /^) 1 is normally distributed about 1 - 2/(9s) with standard 
deviation ^(2/9y). That is to say, the probability levels may be calculated from 

<9) 

Comparative numerical values showing the relative accuracy of these two 
formulae are given in the Note by Mrs M, Merrington on pp. 200-2 below. 
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MISCELLANEA 

(i) Theory of Probability. By Harold Jeotreys. Oxford University Press. 1939. 

7 + 380 pp. 21a. net. 

In the histoiy of the application of probability theory to the problem of drawing inferences 
from observations, no set of idoas has played a more controversial role than that asso¬ 
ciated with inverse probability. Variolas leaders of modem thought hi statistical inference 
have pointed out the logical difficulties inherent in the application of inverse probability. 
R. A. Fisher, through hia methods of maximum likelihood and of fiducial limits, has intro¬ 
duced principles of statistical inference which make the introduction of the notion of inverse 
probability irrelevant. These principles have been extended and refined by J. Neyman, 
E. S. Pearson, A. Wald and others, until now we have available in statistical literature a 
self-consistent discipline of statistical inference which is independent of inverse probability. 

In the present book the author proposes a system of statistical inference based on the 
principles of inverse probability, applying it to the same problems which have been treated 
by Pisher, Neyman, Pearson and others without using inverse probability. The attitude 
which the author takes towards probability theory is somewhat similar to that taken by 
J.M. Keynes. Probability is regarded as a subjective phenomenon. The essential idea is that 
probability is a matter of comparing ‘reasonable degrees of belief’ in propositions. In 
Chapter i the author goes through a considerable amount of psychological and philosophical 
discussion attempting to justify this approach. This discussion is finally formalized by a set 
of six axioms. The primitive or undefined notion is that of the relation ‘given p, q is more 
probable than r where p, q, and r are propositions. The symbol used for denoting the prob¬ 
ability of q given p is P{q | p). The six axioms which are used are as follows: 

(1) Given p, q is either more or less probable than r, or both are equally probable; and no 
two of these alternatives can be true. 

(2) If p, q, r, a are four propositions and given p, q is more probable than r and r is more 
probable than a, then given p, q is more probable than s. 

(3) All propositions deducible from a proposition p have the same probability on data p; 
and all propositions inconsistent with p have the same probability on data p. 

(4) If, given p, q and q' cannot both be true, and if, given p, r and r' cannot both be true, 
and if, given p, q and r are equally probable and q' and r' are equally probable, then given 
p, ‘ q or q' ’ and ‘ r ot r' ’ are equally probable. 

(5) The set of possible probabilities on given data, ordered in terms of the relation ‘more 
probable’, is not of higher ordinal type than the continuum including the end-points. 

(6) If pq entails r, then P[qr i p) = P(q | p). 

In axiom 0, the expression ‘a entails b' is defined as meaning ‘a is deducible from b\ or 
‘a is identical with &’, or ‘a is identical with some proposition asserted in 6’, The expression 
‘ah' is taken as the logical product, that is 1 both a and b ’. 

The introduction of numbers for expressing probabilities is made through three ‘con¬ 
ventions ’. The first convention associates the larger of two numbers with the more probable 
of . two propositions, the second one states that if given p, q and q 1 are mutually exolusive, 
thenPiglpJ + P^' |p) = P(q or q'\p), and the thirdstates that if p entails g, then P(q\p) = 1. 

In order to be thoroughly rigorous in his axiomatic approach, presumably the author 
should have postulated the existence of an aggregate of propositions on which to operate. 
The possibility of an infinite number of propositions should, of course, not be excluded, as 
will be seen when the author applies his theory to problems involving continuous random 
variables. In the case of an infinite number of propositions, it appears that the assumption 
should be made that axiom 4 would hold in case of two infinite sets of mutually exclusive 
alternatives. A similar assumption would have to be made for the second convention. The 
proponent of the measure theory approach to probability has essentially the same problem 
to deal with, but he handles it by assuming the existence of set functions which are completely 
additive over his postulated field of sets. 
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Proceeding from his six axioms and three conventions the author completes his chapter 
by establishing twelve theorems which are used as the basis for the work in the subsequent 
chapters. The question of an infinite number of alternatives is not covered by these theorems. 

Chapter n, on ‘Direct Probabilities’, is devoted to derivations and discussions of the 
binomial, normal, Poisson, Pearson, multinomial. Chi-square, t, a and other frequency laws 
and their properties. The characteristic function and illustrations of its use in the deter¬ 
mination of probability laws are presented. 

In building up his system of statistical inference Jeffreys proceeds in Chapters m-vin 
by applying the principle of inverse probability to the results of Chapter n. Thus if S is a 
set of observations subject to a given discrete distribution law P(S \ OH) derived under a 
distribution hypothesis H, where 6 is a parameter to he estimated, an a priori probability 
function g(6) dO for 0 is introduced. The posterior probability law of 0 given S, say P(d | SH), 
is given by 

I EP(S | 6H)g{6) dd 


where E denotes summation with respect to all possible configurations of values of iS, and 
the integral is taken with respect to 6. A similar analysis results when 8 is subject to a con¬ 
tinuous distribution law. The function P(6 1 SH.) is then taken as a basis for estimating 0. 
For two values of 6, say and 0, it 6 1 is called more probable than Qi if P[0 X \ SH) > P(0 2 ( SH). 
Needless to say, this comparison of posterior probabilities depends on the choice of g(6). 
Now, from the point of view of applying this discipline, g(d) would rarely, if ever, be known, 
and the controversy over inverse probability centres around the problem of choosing g(0). 
The author adopts two rules for selecting g{6): (1) If the parameter may have any value in 
a finite range, or from — co to + <x> its prior probability should be taken as uniformly dis¬ 
tributed. (2) If the parameter may conceivably have any value from 0 to + co, the prior 
probability of its logarithm should be taken as uniformly distributed. The adoption of these 
two rules appears to the reviewer to be extremely vulnerable. First of all, what does it mean, 
in general, for a parameter to be uniformly distributed on the interval — oo to + co ? This 
question appears to be particularly in order since the author is performing the formal calculus 
of probabilities in exactly the same manner in which measure proponents calculate prob¬ 
abilities. They continually use the property that a finite total probability (taken arbitrarily 
as unity) is associated with each probability function. Presumably, meaning could be 
injected by carrying out the work for a finite interval — K to K and then taking the limit of 
the results or answers as If co. Similarly, it may be asked what it means to have the loga¬ 
rithm of a parameter uniformly distributed on a semi-infinite interval. Owing to the nature 
of the particular problems to which Jeffreys applies these rules, it happens that P(S \ OH) 
is such that formal difficulties of convergence do not arise in obtaining P(0 | SH), For the 
finite interval, why should one choose the parameter to be uniformly distributed rather than 
the square or some other function of the parameter, or for the sefni-infinite interval (0, oo) 
why should one choose the logarithm of the parameter rather than some other function to be 
uniformly distributed? It is easy to show that the assumption of uniform distribution is, in 
general, inconsistent with that of uniform distribution of any single-valued function of the 
parameter. 

Chapter IV, entitled ‘Approximate Methods and Simplifications’, contains discussions, 
some of them rather heuristic, of the problem of estimation involved in such topics as 
maximum likelihood, least squares, errors due to grouping, rank correlation, contingency, 
artificial randomization, etc. The inverse probability approach is, of course, maintained 
throughout. 

The problem of significance tests is treated in Chapters v and vr, J effreys’ attitude toward 
significance tests follows along the lines of his concept of probability and consists in com¬ 
paring posterior probabilities. More specifically, suppose 6 is a parameter and it is desired to 
test the hypothesis that 6 = 0 on the basis of a given set of observations and an hypothesis H 
regarding the distribution law of S for given 0. Let q denote the hypothesis that 6 = 0, and 
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q denote the hypothesis that 9 has some other value. Jeffreys’ criterion for making the 
significance test is the ratio K of the two posterior probabilities P(q \ SH) and P(q | SH). 
The value of 11 itself, and not the probability integral of K under the null hypothesis q, is 
proposed as the criterion. Expressions for K are found for such problems as contingency, 
comparison of means and variances in samples, consistency of two Poisson parameters, 
correlation—problems which have already been treated by Fisher, Neyinan, Pearson and 
others by other approaches free from inverse probability. In the treatment of all of these 
problems, the author arrives at four principal forms of K, which he proceeds to tabulate in an 
appendix for various values of sample size and for five grades of significance corresponding to 
K — 1,10 - *, 10 _l , 1(H, 10~ 3 . The last two chapters, i.e. vii and viii, are entitled‘Frequency 
Definitions and Direct Methods’, and ‘General Questions’, respectively. These chapters 
are primarily philosophical excursions undertaken in an attempt to show that no existing 
definition of probability avoids the notion of ‘degrees of reasonable belief’, and to justify his 
own approach as well as his attitude toward inverse probability. The discussion is almost 
entirely informal and non-mathematical and as such it must be regarded in the category of 
personal opinion. 

The book lacks strict mathematical rigour in various places, but from the point of view 
of general flow of discussion it is interestingly written. It contains many keenly chosen 
quotations and side remarks charged with a delightfully subtle humour which has cha¬ 
racterized the author in othor books. 

From a scientific point of view it is doubtful that there will be many scholars thoroughly 
familiar with the system of statistical inference initiated by R. A. Fisher and extended by 
J. Neyman, E. S. Pearson, A. Wald and others who will abandon this system in favour of 
the one proposed by Jeffreys in which inverse probability plays the central role. 

PRINCETON S. S, WILKS 

UNIVERSITY 


(ii) A Bibliography of Human Morphology, 1914-1939. By Wilton M. Krogman. 

United States of America: University of Chicago Press; Great Britain and Ireland: 

Cambridge University Press. 1941. Price 18s. 

The title of this volume may mislead to some extent . The 11,000 odd references in it were 
collected to aid physical anthropologists and the work will be of greater value to them than 
to other research workers, such as anatomists and geneticists, who are concerned with human 
morphology. The non-German literature is said to he covered more thoroughly than in the 
second edition of Rudolf Martin’s Lehrbuoh, and there is no other comprehensive bibliography 
of the subject for the period since 1928. It is not claimed that the list is exhaustive, and the 
most stringent selection appears to have been made in the section on blood groups for which 
fuller bibliographies ai'e available. 

G.M.M. 


(ill) A Property of the Distribution of Extremes 

By H. E. DANIELS 
Wool Industries Besearch Association 


If the chance of an observation being less than x is P, then P n is the chance that the greatest 
of a random sample of n is less than x. The constants of the distribution of the greatest of a 

dx 


sample in the important ease when P 


-SI 




V(2ti) 


have been calculated by Tippett (1925) 


for values of n up to 1000, and in a paper in which all possible limiting forms of P" are dis¬ 
cussed, Fisher & Tippett (1928) give limiting formulae from which approximate values of 
the constants are calculated for laTge samples. 



Miscellanea 


195 


In the present note attention is drawn to a curious approximate relation connecting the 
mean M and standard deviation cr of this distribution which holds with high accuracy for 
all values of n. It was arrived at empirically and appears to have no obvious mathematical 
derivation. The formula is M = 2 QQt ^ 


Tho values of 2 cot calculated from Tippett’s figures, are compared in Tablo 1 with 
Tippett’s values of M. The greatest discrepancy, 5 = ill — 2 cot Jttct, over the range of n up to 
1000 occurs at n = 10 where it is no more than about %. For n greater than 1000 tho 
penultimate limiting values given in Table A of Fisber & Tippett’s paper are used in Table 2 
and the discrepancies are again found to be small. Tho most serious is of tho order of 3 % 
when n is 7228, but their Table B suggests that the penultimate limiting values of M and cr 
are probably underestimated and as 2 cot \ita is fairly sensitive to changes of cr in the region 
of or — 0-3, the real discrepancy may perhaps be smaller. 


Table 1 


n 

M 

<7 

2 cot \rtcr 

&■- If—2 cot 

i 



0-0000 


2 



0-5617 


5 

1-1630 


1-1448 


10 



1-5176 

-0-0212 

20 


0-5251 

1-8483 

-0-0192 


2-3.193 

0-4545 

2-3086 

-0-0107 



0-4294 

2-5012 

-0-0064 




2-7448 


500 

3-0367 


3-0407 


M' 

3*2414 

0-3514 

3-2475 



Table 2. Penultimate approteimate values 


n 

M 

<r 

2 cot -|tt cr 

8 ~ M —2 cot 

7228 

3-7697 

0-3039 

3-8661 

0-0964 


4-7719 

0-2499 

4-8311 


264 x10 s 

6-9262 

0-1787 

6-9369 

0-0107 


Fisher and Tippett show that in the ultimate limiting form of the distribution tho mode 
m, mean M and standard deviation cr are related by the formulae 

M = m+yc, cr 1 = ^jr s c s , 

where c — m/(m a + 1) and y = 0-577216 is Euler’s constant. Consequently 






i 1 n 

+ ' yo ~ c - = ,^r 


1-282 


as <r becomes small with increasing n. On the other hand, our approximate relation gives 

4 1-274 


M = 2 cot iffer-- 


7 nr 


The error at n = oo is thus seen to be less than 1 %. 
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(iv) Proof of Relations connected with the Tetrachoric Series and 
its Generalization. 

By M. G. KENDALL 

If a bivariate normal distribution F with variates x x and x % and correlation p is doubly 
dichotomized at x x = h, — k, and 


d 


/’CO /*oo 

- Jj ."- 


it is known that 


d - S P r T r (h)T r (k), 
r=° 

where T f is the rth tetrachoric function, defined by 


T r ( X) = ■ 


(H)» 


(1) 


( 2 ) 


and/(a;) being the function 


e - '** and H t (x) the rth Hermite polynomial defined by 

'f(x) = (-Dyf(x). (3) 


V(2n) 


The purpose of this note is to present a simple proof of this result,* to prove that the 
series of equation (1) is convergent for | p | s? 1 and to generalize the series to the case of the 
multivariate normal distribution.f 
The characteristic function of 

dF = exp {" 2(1^) - 2pXlXi + “®) 

I* co roo 

is, by definition, 2 )= exp {it x x x 4- it 2 x 2 }dF, 

J — CO J — co 

and is easily seen by direct integration to be equal to 

exp { - 4(t? + ipkh + * 2 )}- 


We have then, for all finite t L , i 2 . 


(*) 




«H<i> h) = exp ( - exp (- \t\) S ( -p) 

r=0 f 


/'co I*co 1 /*oo /*oG /*ao /’co 

Now d — I I dF = —— I dxA dx t \ <5(t l ,L)exp{-it l x 1 — it i x 2 }dt 1 dt i . 

JhJk (2*rJh Jk J-aoJ-co 

Substituting for <f>{t v t t ) we have, for the coefficient of ( ~ p) r jr\ in this expression, 

1 /’co i’oo /’o0 /* co 

4^2 j h dXl J k dXl J J exp (-• M) exp (- Jfij) k t [exp {~ it 1 x x — it 2 x 2 } dt x dt v 

and this is the product of two integrals, the first of which is 


=j>r. 


exp ( — \£\) t\ exp {—ity a;,} dt x 


(5) 


( 6 ) 


and the second of which is a similar expression in aJ 2 and t 2 . 

* The expansion (1) appears tp have been given for the first time by G. Mehler, ‘Reihew- 
entvricklung nach Laplaceschen Punotionen hoherer Ordnung’, J. reine cmgew. Math. 66, 161. 
f See Note at end of paper. 
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Now, since j exp (~ $*) t r e~ iix dt ■■ 


d{—ix) r 


exp (— U 2 ) e~ ttx dt 


= ^(2n)i r D r e-^ 

= 2 n(~i) r H,{x)f(x) y 


(6) is equal to 


PCC 


(~i) r dxB r (x)f(x) = (~iYH r ^(h)f(h), 


and thus (6) is equal to 1 ) r H r _ 1 (h)f(h)H r _ 1 (k)f(Jc), 


and hence 


d = 2~H r _ x (h) f(h) H t _ x {k)f(k) • 

T I 


= Zp'T r (h)T r (k), 

the tetrachorio series. 

Now for the eonvergen.ee of the series, consider 


From (6) 


| T r {h) T r (h) | = (r !)-* | H r ^h)m H r _ x (Tc).m- 

I H r -i(h) f(h) | *S ^ | J exp (- |t 2 ) dl 

1 

a? - exp { — W)t r ~ x dt 

n J o 

f-a 

2 a Jr\ 


I r T (h) 7 r (k) | < 


2*-z 


2 r-2 e -(r-» 2 ^!_!^ 
TT^j(2n) e~'r r +i 


*/(27r)r ! ’ 

Thus the tetrachorio series converges for | p ) < 1, though possibly slowly near \p \ — 1, 
Now consider the general multivariate normal distribution. 

dF ~ —^—j oxp - ■i~{2R jj xf+^FR ti x J x h }da; i ...dx n , 


(infR* 


R - l Pu ... p ln 
Pil 1 ••• Pin 


I Pin Pin 

and R jh is the minor of the yth row and fcth column. 
The characteristic function is 


/ CO Poo 

... dF exp (Ritj Xj) dx x ... dx n . 

— co J —co 
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To evaluate this integral make the transformation 

Xj — Xt 
k 

and choose the a’s so as to reduce the second degree terms in g in the exponent to the canonical 
form £g 2 . The remaining terms in t and £ will obviously be linear in each. We may then 
make a further transformation £,' = £■—(linear function of the t’a) and ti\e remaining terms, 
apart- from Ag' 2 , will be a quadratic in the t’s, and equal, say, to 

+ 2£8j i t J t k . 

Tho integration, abolishes the terms in fj and wo find that the characteristic Function is 

proportional to . 

©xp {Ay + ■2SS lk t j i k }. 

Putting all but two of the t’s aero we see bjr comparison with (4) that the terms in t jt t h 
is - \(t) 4 - tl + and thus tho characteristic function is 


exp {— %(Ptf + l 2Zp it tjt k )}. 


( 6 ) 


The generalized tetrachoric expansion can be obtained by an expansion of (9) in terms 
of the p’s and tho application of tire foregoing procedure. For instance, with three variates 
we have 

(-l) r 

oxpl.’f ~P]i c tjtjf) — A'——-—(PuMs+PasMab/hsMsT 
r ! 




and on integration 

/**» f*00 1*03 

d = 

J h\ J h% J hi 


dF 




which will also be found to be convergent. 


[iVc>/«. Since writing this paper, under the impression that the results wore now, I am 
indebted to Dr A. C. Aitken for pointing out that similar results have been given by him in 
lectures for a number of yours. Among published work, reference may be made to: 

(ct) P. 176 of Aitken & Turnbull’s Theory of Canonical Matrices (Blackie, 1931), where 
a more direct method of deriving tho characteristic function of tho multivariate normal 
distribution has be,en given; 

(6) A paper on ‘Fourfold sampling with and without replacement’ by A. C. Aitken & 
H.T. Gonin (1936, Proc. Roy. See. Edinb. 55, 114), where tho tetrachoric expansions are 
discussed and new 1 series associated with the correlated binomial and tho correlated hyper-, 
geometric distributions are derived. 

As the work of tho Edinburgh school on this subject may not, however, be generally 
familiar, tho Editor has suggested that this short paper should be published, together with 
the foregoing references, in order to bring the recent developments before a wider statistical 
audience, m.g.k.] 
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(v) The Cumulants of the Distribution of the Square of a Variate 
By J. B. S. HALDANE, F.R.S. 


The following problem has arisen in several biometric investigations. The cumulants of the 
distribution of x are known, and it is desired to find the cumulants of the distribution of x*. 
As this problem is likely to arise in future, it.seems desirable to give the appropriate trans¬ 
formations for the first few cumulants. 


Let fCj, K,j, x 3 ,... be the cumulants of x. 

Let ji v /4> /A* be the momenta of x 1 about zero. 

Let /i», /i 3 , ... be the moments of a: 2 about its mean. 

Let k[, k' v Kg, ... be the cumulants of a 4 . 

Then /(' is the 2rth moment of x. Those have been given in terms of the cumulants up to 
the 10th, i.e. /4, in the general case by Kendall (1940), and up to the 12th, i.e. /tj> by Haldane 
(.1938) when = 0. We consider the general case first. We have such expressions as 

= k\ + Ck\ x a 4-4x x ^ + 3fCj -I- x.,. 


From these we calculate the moments fi„ and hence the cumulants. The results are: 


xi = x 4 4-x a , 

Kij = 4/c 4 x a 4*2(2xiX s 4~x a ) 4-x 4 , 

Kg = 8 k?(XiX 8 + 3/fa) 4- 4(3/cJ/Cj 4- 12XiX a x s -l- 2 k\) + 2(3/c 1 /c s + 6 x a x 4 + 5 K%) + X 6 , 
k', ~ 12/fi^3^3+ 12/c|) 

4- 16(2x1 x 5 4- 18xjx a x 4 4- I2/C1/C3+ 36/c 1 /c^/c a + 3/fa) 

4- 8(3/c?/c 0 + 18/fi/c a /c 0 4- 32/q/c 3 /c 4 + 18xlx 4 4- 30 x 2 x!) 

4- 8(/Cj k.j 4 - 3x a /Cg -j- 7/Cg /c 5 4* 4/cl) L x 8 . 


( 1 ) 


After tliis the expressions beoonae very heavy. When /Ci = 0, i.e. x has its mean zero, most 
of the terms vanish, and we have 


K'l ~K if 
K 3 ss 2x a ~r Kg, 

/c a 8/Cg4- 2(6x a x 4 4- 5x 4 ) 4- 6, 

k[ = 48/cl 4- 48/c s (3/fg/c 4 4- 5x 4 ) + 8(3/c a /Cg 4- 7k 3 Xj 4 - 4k 4 ) 4- /c a , 

K n as 384/cl + 960x5(x a k 4 4 - 5/cl) + 80( 16x a xl + 28/c a /c 3 K e 4 - 6k1k 8 -b 25xf x 4 ) 

4-2(20/c a /c 8 4- 00/Cg k a 4* f 00/Ci k 3 4- 63/c 4 ) 4“^ro, 
fc'e as 3840/cl -b 90OO/cl(3/c a /c 4 4- lOxf) 4- 4800(2x1/c 8 4- 14/rfj/c s /c 5 4- 8/^x1+ 2r>K i Kpq-{- 3 xj) 
4- 40(30x1 x 8 4- 180/cj x s x, 4- 300x a x 4 x 4 4- 228xfx e 4- 189x s x? 4- 672x,x 4 x 5 4- 132x1) 
4- 4(15XjXi 0 4- 66XgX 9 4- 120X4X3 + 198x s x, 4- 113x1) 4- x la . 


(2) 
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Finally, if a; be symmetrically distributed, so that all its odd cumulants vanish, 

4 - * s , 

4 - 2*i|+*«, 

4 = 8*3 + 12k s) k 1 + k i , 

k [ = 48 * 3 + 144 * 1*4 + 8 ( 3 * a * s + 4 * ij ) + * 8 , 

*; = 384 /fj + 1920 * 3*4 + 160 * 2 ( 3* a * 9 + 8 * 3 ) + 40(* a * 8 + 5 * 4 * 6 ) +* 10 > 

*„ = 3840 *| + 28800 * 3*4 + 90 OO * 3(* S * 8 + 4 * 1 ) + 240 ( 6*|* 8 + 50* 2 * 4 * 6 + 22 * 3 ) 
+ 4 ( 15 * 3 * 48 + 120 * 4 * & + 113 *|)+* u . ' 


( 3 ) 


I have bracketed together terms which are products of the same number of */s. If x is 
a linear funotion of observed numbers in a sample of n, every K n is proportional to x, so the 
terms in brackets, will all be multiples of the same power of n. 


REFERENCES 

Haldane, J. B. S. (1938). ‘The first six moments of for an n.-fold table with n degrees 
of freedom when some expectations are small.’ Biomeirika, 29, 389-91. 

Kendall, M. G. (1940). ‘The derivation of multivariate sampling formulae from uni¬ 
variate formulae by symbolic operation.’ Ann. JSugen., Lond., 10, 392-402. 


(vl) Numerical approximations to the percentage points of 
the x 1 distribution 

By MAXINE MERRINGTON 

The use of two approximate formulae has been suggested for calculating a percentage 
point, the X 1 distribution, corresponding to v degrees of freedom and a probability 

level PA Both formulae involve the use of the standardized normal deviate, y p , corre¬ 
sponding to the value of P chosen. 

(1) R. A. Fisher’s (1926 ) formula: 

= (a) 

which assumes that \K^X“) i 0 normally distributed about *J{‘Zv— 1) with unit standard 
deviation. 

(2) E. B■ Wilson. & M. M. Hilferty’s (1931 ) formula: 

= ~l+yrj$> (b) 

which assumes that (x 2 /^ is normally distributed about 1 — 2/(9v) with a standard devia¬ 
tion of ,/(2 j9v). 

Professor Pearson has suggested that I should prepare a comparative table showing 
for certain v and P the numerical values of : 

(а) calculated from Fisher’s formula, 

(б) calculated from Wilson & Hilferty’s formula, 

( 0 ) the correct values taken from Miss Thompson’s table (pp. *188-9 above). 

* The notation used is that adopted in the paper on pp. 187-91 above. 
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(a) EromR. A. Fisher’s formula. (&) from E. B. Wilson & M. M. Eilferty’s formula, (c) from Miss Thompson’s table. 
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Tbo Cf'iiijias’iKan i:; iijitnvu ux t.lu jin.-Rt-dhig; table/- 1 S'l. o'ili bo noon hluifc: 

(1) For tlkn vts:>^v of ■ ohx.s of r and 1‘ tumr.idermi, Use formula. (/>) it: ixvneiattsvvtly move 
nccumfo tJiftii (it,); ilia i/rontait u-lwokitu value of Mi© orror in about 0-0-i for (b) wLiiio at. bho 
fciiila oi" fho disU-ilyiibioii it. aiummIn oo ovor i-vtO for (ft). 

(2) Foi'iuuJu. ((>) jb nxk'oiru.ty t’.cmuv to hi the noiijiibouchood of fcho two ft% UoiihiH 

(P 0-080 til id 0-060). 

(3) For neither formula do Sho notual r-rroi wiluwi deoronso very mudi m t> hiiti«-.»gs 
but, since iiicroiw-fi .viUi v, the relative oi-ror values do, of course, decrouso. 

RE FEBEM CJSM 

Jj’xaHEii., if. A, (! i).‘38). yitaiii4ii‘‘ii Mcihodn for Rstmarvh Wurksi'u. Edinburgh: Oliver and 
Boyd. 

CLvitvoun, F. (Jit,it!), liioinnlrih .-, -iU. 

Wu-soii. hit. -D, /, Hua'isuTV, AL .rti. ( iOSl). Pro e, Nd. .Arm!. AW., \V\rjh., il'f, (igi. 

” At! ■-■B.rJior fio.'.ipai.-iiioi). of the ;• I ipivi'.,:irr!n.(iotM hrvi lasnu ituulu by F. fhw.’ood (1930), 
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MEDICAL STATISTICS FROM GRAUNT TO FARR 

(Continued*) 

BY MAJOR GREENWOOD 

in. THE STATISTICAL WORK OE GRAUNT 

John Gratint’s contribution to our subject has always been regarded as one 
of the great classics of science. A few have indeed doubted whether so great 
a work could have been achieved by one whose material success was so modest 
and have sought to transfer the glory to Graunt’s highly successful friend Petty. 
This dispute I relegate to an appendix. I assume that Graunt’s published book 
is substantially his own original work. 

The history of the material Graunt used has been written more than once 
and I have nothing to add to Prof. Hull’s story. Graunt had, for a period of more 
than 60 years, arithmetical statements of the numbers of males and females 
christened and buried and of the causes of death (not distinguished by sex) 
under some sixty headings. He had no information as to the ages at death. He 
had no information as to the number or ages of the living population. 

The first act of a scientific statistician is to assess the trustworthiness of his 
data, to criticize his sources. This 1 tedious preliminary to the doing of sums was 
not much to Petty’s taste. Petty, as we have seen, often used different data to 
reach some conclusion, but hardly ever discusses the reliabilities of the several 
data. Other Fellows of our College since Petty’s day have made the same 
mistake. The terrible ‘howler’ committed by Dr William Heberden the younger, 
and detected, not without satisfaction, by Charles Creighton is classical.f But 
that was not a unique instance. Indeed, even trained statisticians sometimes 
confuse names with things. More than one rate of mortality has risen (or fallen) 
only on paper. Graunt made no such mistakes. 

Graunt’s general argument is that many causes of death are ‘but matters of 
sense ’, for instance, whether a child were abortive or stillborn, and that in many 
cases the searchers are 1 able to report the opinion of the physician, who was 
with the patient as they receive the same from the friends of the defunct’. But 
sometimes the searchers will, be wrong and often enough the error will not matter. 

As for consumptions, if the searchers do but truly report (as they may) whether the 
dead corpse were very lean and worn away, it matters not to many of our purposes whether 
the diseases were exactly the same, as physicians define it in their hooks. Moreover, in 
oase a man of seventy-five years old died of a cough (of which had he been free, he might 

* The earlier sections were printed in Biometrika, 32, 101-27. 

t Creighton, History of Epidemics in Britain, 2, 747-8. Heberden supposed (erroneously) that 
‘Griping of the Guts’ of the Bills was Dysentery and had decreased. It was Infantile Diarrhoea 
and had simply been transferred to the rubrio ‘Convulsions’. 
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have possibly lived to ninety) I esteem it little error (as to many of our purposes) if this 
person be in the table of casualties, reckoned among the aged, and not placed under the 
title of coughs (348).* 

No doubt this brutal common sense might set on edge the teeth of some 
Fellows of the College of Physicians even in the seventeenth century, but it was 
one of the qualities which made Graunt a pioneer. Making the best the enemy 
of the good is a sure way to hinder any statistical progress. The scientific purist, 
who will wait for medical statistics until they are nosologically exact, is no wiser 
than Horace’s rustic waiting for the river to flow away. 

Graunt, however, did not accept statements which he had the means of 
testing. Finding in a series of years that of more than a quarter of a million 
deaths only 392 were assigned to the Pox, he did not infer that Syphilis had 
been over-rated as a cause of death. 

Forasmuch as by the ordinary discourse of the world it seems a great part of men have, 
at one time or other, had some species of this disease, 1 wondering why so few died of it, 
especially because I could not take that to be so harmless, whereof so many complained 
very fiercely; upon enquiry, I found that those who died of it out of the hospitals (especially 
that of Kingsland, and the Lock in Southwark)'were returned of ulcers and sores. And in 
brief, I found, that all mentioned to die of the French Pox were returned by the clerics of 
St Giles’ and St Martin’s in the FieLds only, in which places I understood that most of the 
vilest and most miserable houses of uneleanness were: from whence I concluded, that only 
hated persons, and such, whose very noses were eaten off were reported by the searchers 
to have died of this too frequent malady (356). 

In principle, the argument is still valid. 

His next example of criticism is the ease of Pickets, which first appeared in 
the Bills of Mortality in 1634 and then with 14 deaths only, but by 1659 had 
risen to 441. Was Rickets a ‘new disease’ or did an old disease receive, in the 
Bills, a new name? 

To clear this difficulty out of the bills (for I dare venture on no deeper arguments) 
I enquired what other casualty before the year 1634, named in the Bills, was most like the 
rickets; and I found, not only by pretenders to know it, bnt also from other Bills, that 
livergrown was the nearest. For in some years I find livergrown, spleen, and rickets, put 
all together, by reason (as I conceive) of their likeness to each other. Hereupon I added 
the livergrowns of the year 1634, viz. 77, to the rickets of the same year, viz. 14, making 
in all 91; which total, as also the number 77 itself, I compared with the livergrowns of the 
precedent year 1635, viz. 82. All which showed me, that the rickets was a new disease over 
and above. Now, this being but a faint argument, I looked both forwards and backwards, 
and found that in the year 1629, when no rickets appeared there were but 94 livergrowns; 
and in the year 1636 there were 99 livergrowns, although there were also 50 of the rickets: 
only this is not to be denied, that when the rickets grew very numerous (as in the year 1660, 
viz. 621) then there appeared not above 15 of livergrown. In the year 1659 were 441 rickets 
and 8 livergrown; in the year 1658 were 476 rickets and 51 livergrown. Now though it be 
granted that these diseases were confounded in the judgment of the nurses, yet it is most 
certain that the livergrown did never but once, viz. anno 1630, exceed 100; whereas anno 

* Numbers in brackets are page references to Prof. Hull’s edition of The Economic Writings of 
Sir William Petty together with the Observations upon the Bills of Mortality more probably by Captain 
John Graunt, Cambridge, 1899. 



Major Greenwood 205 

1660, livergrown and rickets were 636, It is also to be observed, that the rickets were never 
more numerous than now, and that they are still increasing; for anno 1649, there were but 
190, next year 260, next after that 329 and so forwards, with some little starting backwards 
in some years, until the year 1660, which produced the greatest of all (367-8). 

This is an excellent statistical argument, and, incidentally, evidence that 
Graunt wrote his own hook, for a physician would probably have suggested that 
the professional interest excited by the classical treatise of Glisson (assisted by 
Regimonter) which was published in 1650 might easily have increased the 
popularity of the diagnosis. Petty, who, with Glisson, was a founder of the 
Royal Society, would hardly have ignored his colleague’s work. 

I cannot resist the desire to mention others which, while of little statistical - 
importance, have a medical attraction. Graunt noticed that Stopping of the 
Stomach first appeared in the Bills of 1636, increased from 6 to 29 by 1647, by 
1655 it reached 145, in 1657, 277 and 1660, 314, First he conjectured that 
Stopping of the Stomach might be the Green Sickness, ‘forasmuch as I find few 
or none to have been returned upon that account, although many be visibly 
stained with it’. He thought that possibly Green Sickness might not appear in' 
the Bills ‘for since the world believes that marriage cures it, it may seem indeed 
a shame, that any maid should die uncured, when there are more males than 
females, that is, an overplus of husbands to all that can be wives’. Then he 
wondered whether Stopping of the Stomach might not he Mother, ‘ forasmuch 
I have heard of many troubled with Mother Fits (as they call them) although 
few returned to have died of them’. But he was diverted by guessing ‘rather 
the Rising of the Lights might be it ’. He remembered that some women troubled 
with the Mother fits did complain of a choking in their throats. ‘Now, as I 
understand, it is more conceivable that the Lights or Lungs (which I have heard 
called the bellows of the body) not blowing, that is, neither venting out, nor 
taking in breath, might rather cause such a choking, than that the Mother should 
rise up thither, and do it. For methinks, when a woman is with child, there is 
a greater rising, and yet no such fits at all’ (359). He notes that Rising of the 
Lights increased in the Bills from 44 in 1629 to 249 in 1660. 

Finally, he suggests a correlation between Stopping of the Stomach, Rising 
of the Lights in adults and the Livergrown, Spleen and Rickets of children. 

‘ And that what is the Rickets in children, may be the other in more grown 
bodies; for surely children which recover of the Rickets, may retain somewhat 
to cause what I have imagined: but of this let the learned physicians consider, 
as I presume they have’ (359). 

It might be suggested that one item under Stopping of the Stomach could 
be surgical, viz. strangulated hernia. Rupture was a heading in the Bills, but 
the numbers are small and show no regular increase with the increase of popu¬ 
lation. Graunt’s attraction to what used to be called hysterical stigmata is 
interesting. One wonders how far these passages reflect conversations with 
Petty. It is clear that Graunt had no belief in the peripatetic uterus; Petty 
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would have had none. The best medical opinion of the age is, of course, that of 
Sydenham. Sydenham (whose pathology was traditional) had a pneumatist 
aetiology of Hysteria, the origin was an ataxia of the animal spirit (which was 
the pneuma zotikon of ancient tradition). He not only believed that Hysteria 
might be a serious or even mortal complication of organic disease—as we do 
still—but that the ataxic spirits might themselves produce humoral corruption 
and lead to chlorosis or ovarian dropsy (Dissertatio epistolaris , 92). So there is 
nothing repugnant to the best professional opinion of the age in admitting 
Hysteria to the list of causes of death. Nor is there any gross absurdity in the 
suggested correlation of increasing Rickets and increasing Hysteria, from the 
point of view of a layman. But that surmise does not imply any professional 
hint, it rather suggests a belief in a merely physical factor, the pressure of an 
enlarged organ. That passage would not have been written by a physician. 

These are sufficient instances of Graunt’s criticism of sources—the temptation 
to go on quoting examples must be resisted. I pass to his great achievement, 
the estimation of rates of mortality at ages when the numbers and ages of the 
living were not recorded. For such an estimation to be correct, we all know that 
the population must be stationary, viz. non-increasing, not subject to migration 
and having constant rates of mortality in the several age groups. 

It is a nice point whether Graunt or Petty appreciated the importance of 
these considerations. Graunt was certainly alive to the fact that the population 
of London was growing and that the growth was due to immigration from the 
country. The arithmetical position was this. In the earlier yeax-s of his series 
burials and christenings were about equal in numbers, in 1605 there were 5948 
burials and 6504 christenings; in 1625, 7850 burials and 7682 christenings, in 
1635, 10,651 burials and 10,034 christenings. Later the burials continued to 
increase, but the christenings either decreased or failed to increase in the same 
proportion. This Graunt attributed to neglect of christening owing to religious 
dissidence and gave excellent reasons for his view. It is clear then that there 
were two factors of increase, immigration and increasing numbers of births. 
Most of Graunt’s deductions are based upon an analysis of the deaths by causes 
for twenty years, 1629-36 and 1647-58, which he selected as years comparatively 
unaffected by plague (of his total of 229,250 deaths only 16,000 were from 
plague). 

If we treat this total as a denominator (or one-twentieth of it) it will, from 
the point of view of calculating mortality ratios, be affected by two errors. The 
deaths of immigrants will make it too large and the increasing births will make 
it too small. Can it be Graunt held that the errors balanced so that, arithmetically 
speaking, one might behave as if one were dealing with a stationary population? 
An alternative explanation is that Graunt did not reab'ze the limitations of the 
method. 

A third possibility is that, although he knew the fallacy, he believed that 
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the incorrect method gave an approximation to truth sufficient for his purposes. 
This is the solution I should he inclined to adopt were I forced to choose. 

As I have pointed out above, there is at least a suggestion that Petty did 
have some glimmering of the conditions to he fulfilled if a summation of deaths 
is to give a correct view of rates of mortality. I do not believe that Graunt was 
less informed on any point of vital statistics than Petty. However, all this is 
guess-work. 

Graunt did not know the ages of the dead; what he did was to pick out 
of the list of causes of deaths those which he thought lighted only upon children 
‘not more than four or five years old’. He chose Thrush, Convulsions, Rickets, 
Teeth and Worms, Abortives, Chrysomes, Infants, Livergrown and Overlaid. 
These gave him some 70,000 out of some 229,000. Then he assigned half the 
deaths from Small Pox, Swine Pox, Measles and Worms without Convulsions 
also to children under six and reaches the final conclusion that about ‘ 36 % of 
all quick conceptions die before six years old’. 

Is this conclusion—I will not say correct, because we have no data to reach 
a correct result—but of a reasonable order of magnitude? The answer is that 
it is eminently reasonable. Two hundred years after Graunt’s death, William 
Parr printed (in the famous Supplement to the 35 th Annual Report of the Registrar- 
General, p. cxxxvi) an outline Life Table for London. This was, of course, com¬ 
puted by an approximately correct method, using knowledge of the numbers 
and ages of the living population, and reflects the conditions of seventy-five 
years ago. Interpolating in this we find that about 32% of ‘quick conceptions 
died before six years old 1 . There is no good medical reason for holding that the 
conditions of child life in London in the middle of Victoria’s reign were much 
better than in the seventeenth century. The old genius used a bow with a frayed 
string and made no allowance for windage, but his arrow hit the target not far 
from the white. He gave the first quantitative measure of the Herodian sacrifice 
in towns, a sacrifice which was to continue to be offered for more than. 200 years. 

Graunt then passed to the other end of life and found that 7 % of the dead 
were ‘aged 1 . He conceived that the searchers would mean by ‘aged’ persons of 
70 years or upwards, ‘for no man he said to die properly of Age who is much 
less 1 . His following suggestion that the proportion living beyond 70 might be 
used as a measure of healthfulness is not happy. But this calculation may have 
led him to make, or insert, the most famous passage in his book, viz. what is, 
in form, the first Life Table ever published. 

Whereas we have found, that of 100 quick Conceptions about 36 of them die before 
they be six years old, and that perhaps but one surviveth 76; we having seven decads 
between six and 76, we sought six mean proportional numbers between 64, the remainder, 
living at six years, and the one, which survives 76, and find that the numbers following 
are practically near enough to the truth; for men do not die in exact proportions, nor in 
fractions, from whence arise this Table following (386). 

Graunt’s figures are 100, 64, 40, 25, 16, 10, 6, 3,-1. 
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The one survivor to 76 is, as Graunt implies, a guess; perhaps he conjectured 
that his seven survivors beyond 70 died one a year. How he calculated his mean 
proportional numbers is unknown. Prof. Willcox conjectured that he experi¬ 
mented with multipliers of S/8 and 2/3—the former nearly reproduces the figures 
(see Willcox, Revue tie VInst. Intern, de Statistique, 5 (1937), 327). Ptoukha 
(Gongres Intern, de la Population ; Demographie historique, p. 71, Paris, 1937) 
ingeniously suggests that he used the multiplier (64 —1)/100 or 0-63. 

We must, I fear, conclude sorrowfully that this shot did not find the bull’s 
eye. If Graunt’s survivors are compared with those shown in Halley’s table 
(when correctly used, vide infra), for 100, 64, 40, 25, 16, 10, 6, 3, 1, we should 
have 100, 56, 50, 46, 38, 31, 22, 14, 6, It is possible that child mortality was 
lower in London than in Breslau, but quite incredible that later age mortality 
should have been so enormously higher. 

But, of course, having regard to the data, it would have been more than 
genius, it would have been magic, had a correct result been obtained. 

Prof. Willcox, whose opinion of Graunt is almost as high as mine, regards the 
passage as inserted on the recommendation of Petty and as Petty’s composition. 
He thinks that it lacks Graunt’s caution and suggests the flighty ingenuity of 
his friend. Prof. Willcox’s arguments are weighty, but I am not convinced. 
That Graunt did not—to use the expressive slang— feature his table is true. It is 
also true ( vide supra) that passages in Petty’s undoubted writings imply that 
he had some conception of a survivorship table. But—and this is my main 
difficulty—if this were Petty’s idea, I find it difficult to believe that he would 
not have exploited it. Halley, whose economic scent was not so keen as Petty’s, 
saw the epoch-making importance of an idea which was to transform the business 
of selling annuities. It would be odd if Petty had seen it that he did not comment 
upon it. Graunt might well have hesitated, being a cautious statistician, but 
surely not Petty. 

However, in spite of modern practice, the writing of history wholly in terms 
of psychology has its pitfalls. 

Let us return to simpler applications of shop arithmetic. The advantages of 
country life over town life from the point of view of both mortality and morality 
had been a commonplace of poets, particularly those Roman poets who spent 
much of their lives in a city, long before the seventeenth century. Graunt was 
the first to apply an arithmetical test of mortality; he compared the statistics 
of Romsey with those of London, For Romsey he had ninety years’ data of 
marriage, christenings and burials. 

His statements about the population of the parish are not quite consistent. 
In one sentence he says that it ‘both 90 years ago, and also now, consisted of 
about 2700’, but a few lines later says ‘it neither appears by the burials, 
christenings, or by the built of new housing, that the said parish is more populous 
now, than 90 years ago, by above two or 300 souls’. A little later he says ‘it is 
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clear that the said parish is increased about 300, and it is probable that 3 or four 
hundred more went to London; and it is known that about 400 went to New 
England, the Caribe Islands and Newfoundland within these last forty years ’ 
(389). Actually, from an estimate of the number of communicants (which he 
assumes to be rather more than half the total population) he makes the average 
population between 2700 and 2300. Taking the average of burials for the whole 
period to be 58, this gives him a death rate of a little more than one in 50, 
which he contrasts with the London figure of one in 32 (apparently based on his 
count of 11 families with 88 persons amongst whom 3 deaths occurred in a year; 
but this is a rate of one in 29). 

There is no doubt a certain sketchiness about this, but it was not unreasonable 
to infer that the Romsey rate was much lower than the London rate. 

Graunt found that, unlike London, Romsey had an average excess of 
christenings over burials, they were in the ratio of 5 to 4. He estimates that over 
the period the natural increase was 1059, and, as will be seen from the quotation 
made, he allots about a third of this respectively to London, to the colonies and 
to the parish itself. He argues that supposing the population of all England to 
be fourteen times that of London and other parishes to send one-third of their 
natural increase to London, then the London burials should increase about 
200 per annum ‘and will answer the increase we observe’. 

Here again the argument is reasonable. He goes on to an investigation which 
has been severely criticized. He gives a table of the greatest and least number 
of burials in each of the ten-year periods for which he has data. In each decade 
hut one the maximum is more than twice the minimum. But, he remarks, in 
no decade in the London experience is the largest number of burials twice the 
smallest number (he excludes deaths from plague from his statistics). ‘Which 
shews, that the opener and freer airs are most subject both to the good and bad 
impressions, and that the fumes, steams and stenches of London do so medicate 
and impregnate the air about it, that it becomes capable of little more, as 
if the said fumes rising out of London met with, opposed and jostled back¬ 
wards the influences falling from above, or resisted the incursion of the country 
airs’ (392). 

Prof. Hull shook his head over this passage. ‘ This is an attempt to explain 
by physical conditions the wide range in the observed country death rate which 
is really due to the narrowness of the field—a single market town—under 
investigation. It is perhaps the gravest statistical mistake that can be charged 
against Graunt’ (lxxvii). 

I do not like to leave a hero in the lurch. I must concede that if both Romsey 
and London burials were samples from a Poisson universe, the fact that the 
Poisson parameter for London was at least a hundred times that for Romsey 
would make it incredible that the London range, in terms of the mean or of the 
standard deviation, should he so wide as that for Romsey. But Prof. Hull was 
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wrong in supposing that the wide range in the Romsey rates was due to the 
narrowness of the field of observation in a statistical sense. 

Taking Graunt’s 58 as the ‘expected’ annual deaths then, as 1/58 is small, 
the Poisson distribution is not far from the symmetry of a normal curve, and 
using the results of Tippett and E. S. Pearson, we may conclude that the 
expected range would be 23-45 + 6-073. The observed ranges for the successive 
decades are 32, 48, 78, 23, 65, 39, 121, 91, 52. All but one is greater than the 
expectation and six diverge by more than three times the standard error. 

Something more than small numbers is involved. Still, it must be confessed 
that Graunt did not anticipate the reasoning of James Bernoulli, although an 
intuition of genius may have led him to think that something more than ‘ chance ’ 
had play here. 

Graunt devoted special attention to the demographic influence of the plague. 
In the first place, he remarks that the attribution plague understated the 
mortality due to plague. He infers this from the fact that in plague years burials 
from other causes exceeded the average greatly, ‘from whence we may probably 
suspect, that about 1/4 part more died of the plague than are returned for such’. 
Next he inferred that after a great outburst plague lingered for several years. 

The plague of 1636 lasted twelve years, in eight whereof there died 2000 per annum 
one with another, and never under 300. The which shows that the contagion of the plague 
depends more upon the disposition of the air than upon the effluvia from the bodies of 
men. Which also we prove by the sudden jumps which the plague hath made, leaping in 
one week from 118 to 927; and back again from 993 to 268; and from thence again the very 
next week to 852. The which effects must surely be rather attributed to change of the a.ir, 
than of the constitutions of men’s bodies, otherwise than as this depends upon that (366). 

Finally, he observes that within two years the city was re-peopled; a deduction 
from the time taken for the number of christenings to reach again the level of 
a pre-plague year. 

We may, if we please, smile at Graunt’s epidemiological inference. But it is 
a reasonable inference from the facts when we remember that in Graunt’s day— 
in spite of Fracastorius—contagium was not thought of as contagium vivum, 
but as a mere sympathetic vibration or passing on of something. 

I have, I hope, given an adequate sample of Graunt’s quality, but have not 
mentioned the most famous of all his deductions. Both in London and the 
country, on the average more males were christened than females, but more 
males died young or entered celibate occupations. So we reach this conclusion: 

We have hitherto said, there are more males than females; we say next that the one 
exceed the other by about a thirteenth part. So that although more men die violent deaths 
than women, that is, more are slain in wars, killed by mischance, drowned at sea and die 
by the hand of justice; moreover more men go to colonies and travel into foreign parts 
than women; and lastly, more remain unmarried than of women as fellows of colleges, 
and apprentices above eighteen etc. yet the said thirteenth part difference bringeth the 
business but to such a pass, that every woman may have an husband, without the allowance 
of polygamy (376). 
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The story of how this arithmetical justification of God’s providence attracted 
the attention of Derham, of how Derham’s book fired the enthusiasm of the 
Prussian Army chaplain Johann Peter Siissmilch and of how Siissmilch’s book 
influenced Malthus has been well told by Hull. I should not myself rank this 
section high among Graunt’s researches. From a demographic point of view 
neither judicial hangings nor college fellowships could have had much effect in 
reducing the male excess. 

Even copious quotation fails to convey the spirit of a complete book. I have 
quoted good things, but many more remain. Graunt revealed sundry important 
truths and not the least important was that very imperfect data, if patiently 
considered, will tell us something it is good for us to know. If young medical 
officers going to parts of the empire where organized medical and demographical 
information is at no higher a level than that of seventeenth-century England— 
and there are many such places—were restricted to a single book on statistics, 
I should advise them to take not a modern scientific work, but old John Graunt’s 
Observations. 


Appendix 

Did Graunt write the book published over his name? 

John Graunt and William Petty were, as we have seen, close friends. Graunt, 
as the world judges success, failed and Petty succeeded. But by the judgment 
of scientific men in the seventeenth century, and ever since, the order of intel¬ 
lectual precedence was reversed. From the moment of publication a few 
discerning people perceived the originality and importance of the Observations, 
the same people who, while admiring Petty’s verve, ingenuity and worldly 
success, did not take over-seriously his bright ideas. 

But Graunt was a man of one book. Save a note upon the multiplication of 
carp and the growth of salmon, he published nothing more. Petty went on 
writing, scheming and talking for thirteen years after Graunt’s death. That often 
enough in that period, the Observations were discussed over the wine—as they 
were quoted in Petty’s writings—we may suppose. That Graunt discussed his 
work with Petty both before and after publication we may also take for certain, 
although we have no formal proof of it. The country statistics which Graunt 
first used were from Petty’s native parish and even if we are not disposed—as 
certainly I am not—to give much weight to particular turns of phraseology, still 
there are sufficient verbal oddities in some pages of Graunt’s book to suggest 
Petty’s hand. 

In these circumstances, it would not be very surprising if Petty’s associates, 
particularly those who were not good judges of statistical work, were to conclude 
that Petty’s share in the remarkable achievement of Graunt were greater than 
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appeared. It is not even judging Petty too harshly to suppose that he himself 
might come to share the opinion. There is no evidence that Petty ever did 
explicitly claim the credit. In one list of his writings (one of four), found among 
Petty’s Papers, he did include the Observations, which at least is evidence that 
he thought himself entitled to a share of the credit. We may suppose that if, 
in familiar intercourse, somebody had said ‘Come, confess Sir William, yours 
was the hand that guided the pen of poor John Graunt’, he might not have 
denied it very strenuously. I think I have produced evidence enough that Petty 
did not under-rate his powers and was not conspicuous for delicacy of feeling. 
My guesB would be that long before his death he did come to believe that 
Graunt’s intellectual success was due to his help. 

Whether Petty believed this or not, it is certain that friends and associates 
of Petty began to believe it soon after Graunt’s death, and the belief has been 
entertained by a few people in each generation since. These, with one con¬ 
spicuous exception, have been drawn from Petty’s friends or descendants or 
from literary critics. 

In the seventeenth century, of Petty’s circle, Evelyn, Southwell and Aubrey 
believed or said that Petty wrote or inspired Graunt’s book. Two Fellows of the 
Royal Society, Houghton and Halley, also attributed the book to Petty. The 
only one of the five who was certainly a competent judge of scientific merit was 
Halley. Halley began the memoir which contains his Breslau table with these 
words: 

The contemplation of the mortality of mankind has, besides the moral, its physical and 
political uses, both which have some time since been most judiciously considered by the 
curious Sir William Petty, in his moral and political Observations upon the Bills of Mortality 
of London, owned by Captain John Oraunt. And since in a like treatise on the Bills of 
Mortality of Dublin.... But the deductions from those bills of mortality seemed even to 
their authors (sic) to be defective. (Phil. Trans, no. 190 (1693), p. 696.) 

Since the seventeenth century, there has been unanimity among demo¬ 
graphic statisticians and economists that Petty could not have written Graunt’s 
hook. Halley was quite as good a judge of scientific merit as any of them and 
a contemporary of the canvassed writers; if I were sure that he had read and 
compared Petty’s acknowledged works with the Observations I should prefer his 
opinion to that of other ‘experts’—including, of course, my own. Halley’s 
direct testimony, in the sense of a court of law, would be valueless; he was only 
six years old when the Observations were published and became a Fellow of the 
Royal Society five years after the death of Graunt. There is no evidence that 
either before or after the period of writing and publishing his famous memoir, 
Halley worked on demography. After his memoir, but in his lifetime, a new 
epoch in mathematical vital statistics began. De Moivre, eleven years younger 
than Halley, brought out his principal works in the lifetime of Halley (1656- 
1742) and used Halley’s table. The two men must have been well acquainted, 
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for both were enthusiastic disciples and intimate friends of Newton, but Halley, 
like Graunt, made only one contribution to the literature of demography. 

So it may be doubted whether Halley were sufficiently interested in demo¬ 
graphic or economic writings to have read Petty's tracts at all. Also in the 
passage cited above (apart from the -writing of ‘authors’ not ‘author’) the collo¬ 
cation of the Observations on the Dublin Bills with those on the London Bills is 
curious. There is no doubt that the Observations on the Dublin Bills were the 
work of Petty, and in the first edition of them they are stated on the title-page 
to be by the ‘Observator on the London Bills of Mortality’. But this, as Prof. 
Hull pointed out (xlii), was probably a catch-penny device of the publisher, 
Mark Pardoe, to draw a public which had just taken a fifth edition of the London 
Observations. Actually the book did not sell, and when the publisher reissued an 
enlarged version, Petty’s name appeared on the title-page without any reference 
to the London Observations. I conclude that Halley’s evidence is less weighty 
than it seemed. He may well have had before him copies of Graunt’s book and 
of the two editions of the Dublin Observations. Having no other knowledge of 
the literature he would naturally enough write as he did, 

If we eliminate Halley, no other expert countenanced Petty’s authorship and 
one, Augustus de Morgan, gave an amusing but quite cogent reason for dis¬ 
missing the notion. 

In speaking of the variations in the annual numbers of deaths attributed to 
Rickets, Graunt said: 

Now, such back-starting seem to be universal in all things: for we do not only see in the 
progressive motion of wheels of watches, and in the rowing of boats, that there is a little 
starting or jerking backwards between every step forwards, but also (if I am not much 
deceived) there appeared the like in the motion of the moon, which in the long telescopes 
at Gresham College one may sensibly discern (358). 

De Morgan ( Budget of Paradoxes, 68; Assurance Magazine, 8,167) commented 
on the improbability that ‘that excellent machinist, Sir William Petty, who 
passed his day among the astronomers’ would attribute to the motion of the 
moon in her orbit all the tremors which she gets from a shaky telescope. 

Down to 1927 the matter was regarded, in scientific circles, as settled. In 
that year the late Marquis of Lansdowne published a copious selection of the 
Petty Papers with what he regarded as new evidence in favour of Petty. 

The only new evidence of a direct kind was a manuscript list in Petty’s hand 
of his writings or projected writings which included the Observations. There are 
three other lists which do not include the Observations, and if we are to suppose 
that the entry really referred to the book published under Graunt’s name, then 
we must believe that in 1685 and in 1686 Petty had forgotten his best title to 
scientific immortality. The remainder of the evidence consists of parallel passages 
and ad captandum arguments to the effect that it was more probable that a 
physician had written on questions of medical statistics than a tradesman. This 
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publication led to a lively controversy. Of the merits of this, I, as a party to it, 
am not an impartial judge. Purely literary arguments do not appeal to me when 
the question is of scientific method. Thus, Dr L. F. Powell attached Weight to 
the fact that Dr Johnson in conversation had attributed to Petty an observation 
(not statistical) which is made not in Petty’s writings but in Graunt’s book. 

In the discussion the word 1 style ’ is used in different senses by the combatants. 
The statisticians are thinking of scientific method, the literary critics of verbal 
arrangement. To the former the fact that, particularly in the conclusions and 
the Appendix, Graunt’s book has turns of phraseology which suggest Petty’s 
hand, seems of little importance. To the latter it seems very significant. 

In the article by Prof. Willcox, which I have quoted above, the controversy 
is reviewed, and the author concurs generally with his statistical predecessors. 

Prof. Willcox does, however, differ from his predecessors in one important 
particular. He holds that the famous life table was supplied by Petty. He argues 
that this is far too conjectural to have been the work of so cautious a reasoner 
as Graunt: 

In attempting to reconstruct its origin I have surmised that after Graunt had estimated 
that 36 per cent, of the deaths were due to children’s diseases, that they all occurred under 
the age of six, and that the seven per cent, who were reported to have died ‘aged’ died at 
over 70 years of age (at one place he says over sixty), he felt unable to go further and 
reported his difficulty to Petty, already perhaps speculating about a series of similar 
problems. 

Petty guessed at the number of survivors at the end of each decennial age period, 
6-16, 16-26 etc. incidentally and characteristically ignoring Graunt’s theory that seven 
per cent, survived seventy, andassuming instead, without reason, that one per cent, survived 
seventy-six and not ono per cent, eighty-six, and that the survivors at age six decreased with 
each age period in a geometrical progression approximately equal to the 64 per cent, which 
Graunt had set for the first group (326-7). 

Prof. Willcox’s argument is cogent. It may be strengthened by a criticism 
of the late Prof. Westergaard (Contributions to the History of Statistics (London, 
1932), p. 23). In using this table, Graunt made a serious blunder. In order to 
estimate the number of men of military age in London he subtracted the number 
alive at age 56 from the number alive at age 16. But this simply gives him the 
number dying between those ages; what he wanted was some average of the l x ’s. It 
is evident that Graunt was not at all clear in his mind as to how to use a life table. 

On the other hand, if this table were really Petty’s idea, it is hard to under¬ 
stand why he did not exploit it. If Petty had been a Halley, the explanation 
would be obvious. The table is wrong; the conditions for the validity of the 
method were not fulfilled. There is indeed (vide supra) some evidence that Petty 
did know what data were necessary in order to construct a proper life table. 
One seems on the horns of a dilemma. If Petty thought the table was correct 
why did he make no further use of the method? If he thought it was wrong, 
would he have urged Graunt to insert it ? 
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Although Prof. Willcox has certainly shaken my previous conviction, I still 
feel reluctant to surrender Graunt’s table to Petty. However, there may be an 
element of sentimentality in this. At least the statisticians agree that the answer 
to the question which I have placed at the-head of this Appendix is emphatically 
yes. 

References to the recent controversy in chronological order 

1927. Lansdowne, Marquis of. The Petty Papers. 

1928. Greenwood, M. J.R. Statist. Soe. 91, 79. 

1928. Lansdowne, Marquis of. The Petty Southwell Correspondence, pp. xxiii—xxxii. 

1932. Lansdowne, Marquis of. The Times Literary Supplement, 8 Sept. 

1932. Bbett-James, N. G. The Times Literary Supplement, 15 Sept. 

1932. Greenwood, M. The Times Literary Supplement, 22 Sept, 

1932. Lansdowne, Marquis of. The Times Literary Supplement, 13 Oct. 

1932. Powell, L. F. The Times Literary Supplement, 20 Oct. 

1933. Gbeenwood, M. J. R. Statist. Soo. 96, 76. 

1937. Willcox, W. F. Revue de Vlnst. Intern, de Statistique, 5, 321. 

IV. HALLEY’S LIFE TABLE 

The long and fruitful life of Edmund Halley (1656-1742) belongs to the 
general history of science; of him it may indeed be said nihil quod tetigit non 
ornavit. He made only one contribution to our subject, but it was of first-rate 
importance. 

The circumstances of this undertaking are obscure; Halley would have 
perceived the imperfections of Graunt’s life table, but it is not known whether 
it was he who set on foot a search for better statistical material than Graunt 
had had. Inquiries were however made, and made after he had become a Fellow 
of the Royal Society, so it is at least possible that Halley, who had travelled 
extensively in Europe (he was at Danzig in 1679 and in Italy in 1681), suggested 
that something might be found abroad. By 1691, the'King’s Librarian, Henry 
Justel, who was in touch with the Society, had been thought into communication, 
possibly through Leibniz, with Caspar Neumann, i scientifically minded evan¬ 
gelical pastor of Breslau. Neumann supplied the ,4ata which Halley used. 

In 1883, J. Graetzer, a medical-statistical official of Breslau, published a 
little monograph* which throws light upon the w6rk. He not only extracted from 
the Breslau archives all the data which were oy might have been communicated 
to the Royal Society but had the Society’s Archives searched, with the. result 
that a letter from Neumann to Justel and /mother from Neumann to Halley, 
both with statistical appendices, were discovered. Thanks to the labours of 
Graetzer and an essay by R. Bdckh (Buljeiin de Vlnst. Intern, de Statistique, 7 
(1893), 1) we can form a reasonably clef! idea of Halley’s method, which was 
not what those who have not examined the literature suppose it to have been. 

It is often stated that Halley, having found that during the five years of 
observation the number of births onlypghtly exceeded the number of burials, 
* Graetzer, J., Edmund Hatey and Caspar Neumann, 1883. 
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treated the population as stationary and constructed a life table by a simple 
summation of the deaths in the manner already explained. He was much wiser. 
What he tried to do was to construct a population table, in the following way. 
Suppose we know how many children were born in a calendar year, say 1690, 
in a town not subject to migration which maintains accurate registers of ages 
at death, and then we discover how many of the children born in 1690 will be 
alive on each successive first of January by a series of subtractions. We shall 
have the survivors on 1 January 1691 by subtracting those of the children horn 
in 1690 who died in 1690. We shall have the survivors on 1 January 1692 by 
subtracting the deaths in 1691 occurring among the survivors to 1 January 1691, 
and so on. This will give a precise enumeration of the living population, and this 
is what Halley wanted. The figures we shall obtain will not be the conventional 
Vs of a Life or (as German writers say) Mortality Table, but what in most 
modern books are represented by the capital letter L or years of life lived or 
“persons” living between the termini (see Appendix). If the population is 
stationary, the sum of these figures gives the population of the place under 
study. Now for ages between 1 (last birthday) and very advanced ages, L x is 
simply \ x diminished by | (1^- l a+1 ). In the first year of life (and at advanced 
ages) the difference is greater. Thus in the first year of life deaths are not evenly 
distributed throughout the year of life, more than 70% of them occur in the 
first six months-, of life, so that instead of subtracting half the deaths we must 
subtract nearly three-quarters. Halley himself assigned 68 % of deaths in the 
first year of life to th,e first half of the first year. The reason why Halley proceeded 
in this way was that he knew the population not to be stationary. His idea was 
to obtain the figures i’br the first few years of life accurately—indeed just as 
they are now obtained-v-and then to correot for excess of births over deaths. 

His masterly plan was partly defeated by the fact that his Breslau corre¬ 
spondent Neumann was n# so good a statistician as he was. Halley’s letter to 
Neumann has not been preserved, we have only Neumann’s answer of 1 March 
1694. Probably Halley askeckNeumann to send him (as a check on the calcula¬ 
tions he had already made) th e exact numbers of survivors on 1 January for 
five years, of births in a calendar year. Neumann did send him a table, hut the 
table, as Graetzer pointed out, i\ wrong. Neumann gave the correct figures for 
1 January of the first suocessive\ year, incorrect figures for the other years. 
Between 1 January of the year following the year the births of which are under 
study, and the next first of January ,\some of those bom in the starting year will 
die under and some over a year of ag\ e - Neumann merely deducted the former, 
so he has too many survivors. To reachi^he right figure would have meant taking 
more trouble and he did not appreciate! the importance of this. Bookh—whose 
opinion of his statistical contemporaries |has a tinge of bitterness rare, of course, 
in other scientific pursuits—remarks thJU it was not strange Neumann should 
miss the point as it had been missed by |many statisticians long after his time. 
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s at least clear that Halley had realized an important truth which did not 
ome part of even expert knowledge for more than a century. 

The precise arithmetical details of Halley’s work are not perhaps of much 
iical interest. Graetzer and Bockh have done a good deal to clear it up. The 
a used (an average of five years) had 1238 births and 1174 deaths and the 
le accounts for 1238 deaths. Halley must therefore have had a plan for 
.■easing deaths. It is likely, from an observation he makes on mortality in 
ist’s Hospital, that he did not wholly depend on the Breslau figures. Graetzer 
gests that Halley may have made two graphs, one having an ordinate of 1238 
;he origin and an ordinate of 64 at the oldest age, the other an ordinate of 
4 at the origin and 0 at the oldest age and that he plotted the survivors for 
h graph based on recorded deaths and drew a curve passing through 1238 
L 0 between these graphs. It may be so. Using the original material which 
ictzer published, Bockh recalculated the table. The results do not, except at 
s over 60, differ materially from Halley’s. So far as concerns the mean after 
time (expectation of life), Halley’s table gives 27-54 years at birth, Graetzer’s 
terial 27-69. For ages under 40 the re-working gives slightly lower and for 
;r ages higher mortality. It may be noted that Halley’s table gives appre- 
fiy higher mortality in childhood than Graunt’s, more than 43 % instead of 
Z 0 are dead by the age of six years. But Graunt’s method would exaggerate 
rtality (so would Halley’s method, but, owing to his precautions, not so 
atly). On the other hand, Graunt’s estimate of age is only an intelligent 
as. Actually, as Graetzer showed, the infant and child mortality shown by 
lley’s table differed little from the observed rates of mortality in the city of 
slau in 1876-80. 

It has been said that Halley was not greatly interested in the medical aspects 
lis work. After describing methods of calculating the prices of annuities, he 
the following passage*: 

ft may be objected that the different salubrity of places does hinder this proposal from 
ig universal; nor can it be denied. But by the number that die being 1,174per annum 
14,000 it does appear that about a 30th. part die yearly aa Sir William Petty has 
iputed for London; and the number that die in infancy is a good argument that the air 
at indifferently salubrious. So that by what I can learn, there cannot perhaps be one 
;er place proposed for a standard. At least ’tis desired, that in imitation hereof the 
ous in other cities would attempt something of the same nature, than which nothing 
laps can be more useful. 

That the mortality of childhood depends upon the atmosphere is not so 
iish an hypothesis as it may seem to us. Halley lived before breast-feeding 
ame the exception rather than the rule. The ‘ curious ’ in other cities had not 
wit to follow his advice. He made no other contribution to the science of 
il statistics; a gain to astronomy but a heavy loss to demography. 

* I have read Halley’s paper in the collection of papers, many by him, collected under 
title Miscellanea Guriosa, printed in London in 1705 ; the quotation is from p. 300. 
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Appendix 

Halley’s table is printed in two columns, the first headed 1 Age current’, the 
second ‘ Persons ’. Thus: 

Age current 

1 

2 

3 

4 

5 

8 

7 

8 

9 

10 

and so on. 

A m istake sometimes made is to suppose that Halley meant by age current 
simply the end of each year of life and that the entry against each 1 age current ’ 
is the number of survivors at exact age one year less than the ‘age current’, 
viz. that of 1000 born 855 survived to the first anniversary, 798 to the second 
anniversary, etc. The fact that Halley uses the round number 1000 for a first 
entry does something to encourage the mistake among readers who have not 
consulted the original paper and it is sometimes made by people who should 
know better. It is actually a terrible ‘howler’, leading to a wholly false view of 
rates of mortality in early life. Thus if 1000 and 855 were really the first two 
entries of a Life Table as set out now, then, as the first two entries in Engb'sh 
Life Table no. 7 Males (mortality 1901-10) are 1000 and 856, we might conclude 
that mortality in the first year of life was no lower in 1901-10 than in Breslau 
in the last years of the seventeenth century. But the 1000 of Halley’s table is 
•not the number of new-born children hut the average number out of 1238 born 
living between the ages of 0 and 1. This is what is called the L x of a modern 
table or the population living between the ages x and x+ 1. If we have a column 
of Lfs, which is what Halley gives us, we can deduce therefrom the more 
familiar Vs provided we know the starting value and the number of deaths in 
the first year of life. Halley gives both items. He says that of 1238 annual births 
348 die annually. So that his 1 0 is not 1000 but 1238, and his l x is 890, He chose 
100Q for L 0 by assuming that of the 348 deaths in the first year of life 238 occurred 
in the first six months of life, 68 %. This differs very little from the modem 
practice; in Life Table no. 7 quoted above 73-5 % of the deaths in the first year 
of life are assigned to the first six months of life. Having been given 1 0 and l x 
we can deduce the other l’s from the values of the L’s which Halley gives 
because, after the end of the first year of life there is little error in supposing 
that the deaths between two birthdays are evenly distributed over the year; 
so, for instance, 1 2 will be equal to L x less half.the difference between l x and 1 3 , 


Persons 

1000 

855 

798 

760 

732 

710 

692 

680 

670 

661 
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and proceeding in this way we put Halley’s table into modern form. I attach 
a table calculated by Bockh. 

It will be seen that, if Halley’s table is properly used, the comparison is not 
of 1000 and 855 with 1000 and 856 but of 1000 and 719 with 1000 and 856. 

Actually this is still slightly optimistic, because I am comparing ‘persons’ 
with males. The ‘persons’ figure for 1901-10 is 1000, 869. On the other hand, 
in the Breslau data still births (or some of them) are included in births, so that 
the mortality is slightly exaggerated. If for instance 7% are still born, the 
survivors to 1 will be the same, but the 1„ should be reduced to 930. Or alter-' 
natively we should write 1000 and 773. 

I attach Halley’s table reduced to modern form and with the corresponding 
expectations of life calculated by Bockh (I have reworked some of the values 
from the data and agree with Bockh’s figures). 


Halley’s Table , expressed in modern form together with the. Expectations of Life 
at quinquennial intervals (Bockh) 


Age 

1* 

4 

Age 

1* 

4 

0 

10,000 

27-54 

40 

3,557 

22-05 

5 

5,816 

41-47 

45 

3,167 

19-47 

10 

5,307 

40-25 

50 

2,751 

17-05 

15 

5,049 

37-19 

55 

2,319 

14-75 

20 

4,806 

33-93 

60 

1,914 

12-33 

25 

4,552 

30-69 

05 

1,511 

9-96 

30 

4,257 

27-64 

70 

1,103 

7-74 

35 

3,921 

24-78 

75 

670 

7-50 


V. GUESSING THE POPULATION 


My object is to trace the growth in our country of that part of statistical 
science which is of interest to students of medicine or public health. In speaking 
of such pioneers as Graunt, Petty and Halley it was proper to construe the 
obligation rather freely. Both Graunt and Petty did clearly perceive the relevance 
of their researches to matters of public health or even clinical medicine, but much 
of what Petty did had a more direct bearing upon political questions than those 
of public health. Again, the life table is a way of expressing the facts of mortality 
which is valuable in some medical researches, but its importance as a statistical 
instrument has been much greater in non-medical than medical circles, above 
all of course in the financing of assurance business. The commercial importance 
of life tables was perceived by Halley and by other mathematicians of his and 
the following generations. 

Looking at the position after Halley’s publication it was clear that progress 
might be made (1) in improving the accuracy of the life table, viz. by obtaining 
data more relevant to the conditions of life of persons who assured their lives 
or bought annuities, (2) in simplifying the very laborious calculations which the 

Biometrika xxxn 15 
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determination of praemia or purchase values required. Under (1) no progress 
worth, speaking of was made in England until the end of the eighteenth century. 
This was partly due, as we shall see, to a not entirely unjustified disbelief in the 
powers of the medical profession to change the rate of mortality, partly to 
ignorance. No first-rate English mathematician after Halley gave any critical 
attention to the theory of the Life Table before the nineteenth century. Under 
(2) considerable progress was made, but this progress is of little or no medical 
interest and to describe it would involve entering upon tedious arithmetical and 
algebraical detail. The primary medical-statistical quaesita are correct enumera¬ 
tions of deaths by sex at ages and by causes, and of the numbers living in sex 
and age groups. When these have been satisfied, the medical statistician can get 
to work. 

For 150 years after Graunt’s death very little was done to improve matters. 
Down to 1801 the population as a whole had not been counted; forty years more 
passed before a reasonable age distribution was secured, and it was thirty-eight 
years after the first denominator (populations) that the first numerator (deaths) 
of the fundamental fractions was, obtained-. Until 1801 intelligent guessing \yas 
the method and the guesses of the eighteenth century deserve a few pages, if 
only because they prove that statistical ability is as rare as other kinds of ability 
and that wishful thinking is not a modern foible. 

The first estimator to mention belongs to the seventeenth century and was 
a younger contemporary of Graunt and Petty, Gregory King (1648-1712). He 
was bom in Lichfield, the son of a land surveyor. At the age of fourteen he was 
recommended as a clerk to the famous herald Dugdale with whom he worked 
for several years; after Dugdale had finished his Visitation, King worked for 
various amateur antiquaries and was eventually invited by a lady of property 
in Sandon (Staffordshire) to be her steward, auditor and secretary. Here he 
remained until 1672 when he moved to London and, no doubt through Dugdale’s 
recommendation, had a considerable amount of employment in both heraldic 
work and ordinary surveying. In 1677 he became a member of the College of 
Arms, in which he attained the rank of Lancaster Herald and so continued until 
his death, but worked for other official bodies on financial subjects. 

The decorous memoir by George Chalmers, from which I have extracted 
these particulars, does not give us a very life-like picture of the man himself. 
There is a certain likeness between the early careers of Petty and King. King 
was not indeed shipped as a cabin boy, but Mr King (the elder) drank (if we may 
venture so coarse an abbreviation of Chalmers’s statement that the father studied 
and practised his profession ‘with more attention to good fellowship than 
mathematical studies generally allow’) and King junior was a pupil teacher at 
eleven. If he really read Hesiod and Homer, made Greek verses and taught 
. himself to survey land in his thirteenth year he must have had Petty’s precooity. 
Both Petty and King had experience of practical surveying and, of course, both 
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were interested in political arithmetic. But there the parallel ends. King was 
a professional surveyor and archivist and had a reasonably successful professional 
career. Petty was—Petty. One might, perhaps, adduce as another parallel that 
King made some enemies and thought himself ill-used. But the job by which 
Sir John Vanbrugh, a stranger to the College of Arms, was made a king-at-armB 
over the head of an official of twenty years’ standing would have galled the 
meekest of mankind. One may safely conclude that King had more knowledge 
of the data of political arithmetic than Petty and less originality. His vital 
statistical work was not published until nearly a century after his death, as an 
appendix to the second edition of An Estimate of the Comparative Strength of 
Great Britain by George Chalmers, London, 1803. Perhaps he never intended 
to publish it—he communicated the substance to his contemporary Davenant— 
and this may explain why there are no details of how some of his results were 
reached. The report reads rather like a document prepared for official use by 
persons interested in results not methods. 

The starting-point of King’s attempt to estimate the population was a return 
from the Hearth Office of the number of houses assessed to tax on Lady Day, 
1690. That was 1,319,216 which, King estimated, had increased to 1,326,000 by 
1696. He deducted 30,000 for empty divided houses,* took the round figure of 
1,300,000 and assigned 105,000 to the London area, 196,000 to other cities and 
market towns and 1,000,000 to villages and hamlets. He used a series of multi¬ 
pliers, 6-4 for a house within the walls of London, 4-6 for a house within the 
liberties, 4-4 for the out parishes in Surrey and Middlesex and 4- 3 for Westminster. 
For other towns, his multiplier was 4-3 and for villages 4-0. 

Having performed his multiplications he gives London a bonus of 10%, 
other towns 2% and villages 1%. Lastly he estimates homeless people to 
number 80,000. The final result to the nearest round number is 6| millions. 

How King obtained his multiplier is not clear. In addition to the Hearth 
Office data he says he used ‘the assessments on marriages, births, and burials, 
parish registers and other public accounts ’ and that from these he deduced the 
multipliers, but this is rather vague. He also classified the population by sex, 
civil state and age (under 1, under 6, under 10, under 16, above 16, above 21, 
above 26, above 60). How he reached these figures is not explained. 

But nothing succeeds like success. As we shall see, his estimate of the total 

* Prof; E. 0. K. Gonner (J.M. Statist. Soc. 76, (1912-13), 261-97), in an interesting paper which 
I have largely used in writing this chapter, remarks that the ‘houses’ of the Hearth Office must 
have been really families or separate occupations as King indeed realized, and thinks that King 
fell into some confusion in attempting to replace families by houses. Gonner argued that the best 
way was to proceed on the basis that the Hearth Office unit of a family should be retained and 
be corrected for empty houses, blacksmiths' shops, etc. on the basis of 1801 census returns and the 
multiplier used should be persons per family of 1801. The result is to give a figure about a quarter 
of a million larger than King’s. The method described in the text also leads to the conclusion that 
King somewhat understated the population. 
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population is probably very near the truth and Prof. Westergaard has remarked 
that, judging from Swedish observations of a few years later, King’s age distri¬ 
bution is quite reasonable. 

As a statistical prophet King was no more successful than his contemporaries 
(and successors). He believed that down to his time the population of England 
had doubled in 435 years, that the next doubling would require from 1200 to 
1300 years and that in a.i). 3500-3600 the population would reach 22 millions 
of souls, in case, as he cautiously adds ‘the world should last so long’. His 
estimates as a matter of arithmetical curiosity are excellently fitted by a logistic 
with an upper asymptote of fifteen millions and would give the present popula¬ 
tion as about eight millions. 

Modern statisticians, such as Parr and Brownlee, have confirmed King’s 
estimate of the population at the end of the seventeenth century in the following 
way. After 1801 the population was known by actual counting and for the first 
forty years of the nineteenth century baptisms and burials were still the only 
data of births and deaths. If one started from, say, the enumeration of 1831 
and worked back to the population of 1821 by adding the numbers of burials 
and subtracting the number of baptisms then, if these really measured deaths 
and births, the result ought to agree with the census enumeration, provided 
immigration and emigration balanced. But the burials and baptisms under¬ 
stated deaths and births. One might adjust the figures by multipliers to bring 
the result into agreement with the census and then test against another backward 
run of ten years. Brownlee found that if the number of burials were multiplied 
by 1-2 and the number of baptisms by 1-243, the agreement was good. 

This may seem a highly conjectural method, but it certainly gives quite good 
results. The difference between births and deaths estimated in this way for the 
decennium 1801-10, I find to be about 12'4 per 1000 living. If one multiplies 
the enumerated population of 1801 by (1-0124) 10 we reach 10-1 millions, not a 
bad approximation to 10-2 millions actually counted. Assuming that before 
1801 burials and baptisms had the same relation to deaths and births as between 
1801 and 1841, we can work backwards to the beginning of the eighteenth 
century with the result that the population then was about 5-8 millions, not 
much more than King’s estimate. In view of the following discussion it will be 
useful to consider the probable state of the population (as determined by these 
methods) in the eighteenth century. In the first sixty years of the century it 
grew very slowly, was about 6-1 millions in 1761 and 6-5 millions in 1761. It 
then began to increase faster, was 7-5 in 1781, 8-2 by 1791 and 9-2 at the census 
of 1801 (8-9 as enumerated, but an estimate of a deficit of l/30th was made).. 

Prom Gregory King’s time to the census of 1801 we have a series of more or 
less intelligent guesses. 

These are well described in Prof. Gonner’s paper.* Two schools of thought 
* J.R. Statist. Soc. 76 (1912-13), 261-96. 
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did battle in the eighteenth century; the pessimists who held that the population 
was decreasing and the country going steadily to the dogs, and the optimists 
who believed just the contrary. Both used the same weapons. The heavy artillery 
was a return of houses for taxation purposes increased conjeeturally by a figure 
for houses which escaped taxation, the sum multiplied by a conjectural average 
of persons per house. As light artillery one had the yield of taxes on commodities 
and the returns of baptisms and burials. 

A pessimist put the number of untaxed houses low and the multiplier low, 
and an optimist raised both. 

The first controversy which took place in 1754 in the proceedings of the Royal 
Society did not attract much notice, Brackenridge (mildly pessimistic) pointed 
out that the number of houses assessed to house tax had decreased between 1710 
and 1754 from 729,048 to 690,000, which suggested a decrease of population 
(by a previous conjectural calculation based on burials and baptisms, he had 
reckoned a small increase, which was probably correct). Much turned on the 
number of houses which did not pay tax (either because the occupant was in 
receipt of alms, did not, owing to poverty, contribute to the church or poor rate, 
or through mere default). Brackenridge put the number at 200,000. His critic, 
Forster, argued that Brackenridge under-stated the number of untaxed houses, 
adducing a sample of nine country parishes with 588 houses of which only 177 
were taxed and a market town with 229 taxed houses out of 448. Using these 
figures as a basis for conjecture Forster raises Brackenridge’s 890,000to 1,427,110. 
From this (with a multiplier of 6 for town houses and 5 for country houses) he 
reaches a population of seven and a half millions—probably a considerable over¬ 
estimate. 

The next controversy was a quarter of a century later (in a period when the 
population was certainly increasing) and its originator was Dr Richard Price 
(1723-91), who has attained a posthumous celebrity reminiscent of the man 
whose title to distinction was that he had once been kicked by George IV. Most 
readers know him as the preacher of a sermon which was the text of Burke’s 
Reflections, most students of economic history know him as the inventor of that 
theory of the virtue of a Sinking Fund which has been likened to the economic 
system of a community which prospered by taking in one another’s washing; 
most vital statisticians remember him as the computor of the Northampton 
Life Table which gave a seriously incorrect picture of prevailing mor tality and in¬ 
directly cost the country a large sum of money. Finally, in the controversy about 
to be described, Price was pertinaciously in the wrong on all the main issues. 

The apparent inference from all this is that Price was either a fool or a knave. 
Gainsborough’s portrait of the Rev. Richard Price, which hangs (or did hang) 
in the Board Room of the Equitable Assurance Society, gives no support to the 
hypothesis that Price was a fool; his life would be a promising field of research 
for a young historian with a competent knowledge of economics. His importance 
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in statistical history is not great enough to justify me in a critical study (even 
if I had the necessary training in finance and economics). My guess is that Price 
was an able, self-confident, original-minded man, who knew a good deal about 
many things and had no exact knowledge of anything. He had ! a way’ with 
him, he could interest people. In fact he had some of the qualities of Petty. It is 
easy enough to make jokes about his notion of the mysterious power of money 
to increase at compound interest and it is possible that William Pitt the younger 
(who was only a hoy when he adopted Price’s theory) was not a good economic 
reasoner. Still, even 150 years ago, there were bankers and Treasury officials, 
and it is possible that both they (and Price) were not so much bad theoretical 
reasoners as shrewd opportunists, that they were deliberately blind to the 
speoiousness of an attractive defence of a desirable financial expedient. I have 
myself sometimes wondered whether, in the eighteenth century, an Assurance 
Society would have minded very much if a Life Table had erred on the pessimistic 
side. 

Price did not enter on the population question with an unbiased mind. He 
was a keen politician and he believed that the policy of the government was bad 
■ for the country; he also believed that the wealth of a country was its people. 
Hence he believed that the population was declining and nothing shook that 
belief. Had he survived another ten years, until the first census, he would 
probably have disputed the accuracy of the returns. 

Price began with the figures of houses in 1690, which he cited from Davenant 
(they were really due to King, who communicated them to Davenant), making 
the total 1,319,000. He then gave the figures of assessed, chargeable and cottages 
(cottages being houses too small to be taxed) as 678,915, 25,628 and 276,149, 
making a total of 980,692 in 1761. In 1777 they were 682,077, 19,396 and 
251,261, a total of 952,734. On this basis he concluded that the population had 
deolined by about one and a half millions and was actually less than five millions. 

Howlett and Wales, Price's chief opponents, impugned every step in the 
reasoning. First, they pointed out that in the estimate for 1690 there was almost 
certainly a confusion between families and houses. Then they argued that many 
householders evaded duty (for instance by the simple plan of blocking up 
windows (the prayer ‘Lighten our darkness, we beseech thee, Oh Pitt’ is still 
remembered) and showed by direct enumeration in certain parishes that the 
returns were inaccurate. Finally, they gave reason to think that Price’s multi¬ 
plier was too small. On each of these points they were probably right. Indeed 
Price was obliged to admit the validity of some of their criticisms. But he 
declined to budge; sometimes he took ad captandum advantage of arithmetical 
slips by his adversaries, sometimes he declined to admit that their samples were 
representative, sometimes he tried to ignore the effect of corrections which he 
was forced to make. 

These were the principal arguments. Both parties used the data of burials 
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and baptisms as subsidiary arguments. Price seems only to have used the London 
Bills, which rather let him down; because although they seemed to help for some 
part of the century, he admits that by 1773 London was increasing and, very 
characteristically, uses this as in his favour: ‘But it appears that, in truth, this 
is an event more to be dreaded than desired. The more London increases, the 
more the rest of the country must be deserted.’ Price’s adversaries went farther 
afield and counted burials and christenings in 162 parishes in all parts of England 
for two quinquennia, one beginning in 1758, theotherin 1773. Baptisms increased 
from 47,638 to 59,567, burials from 49,553 to 53,030. 

But neither party put much weight upon what we should now consider 
primary evidence; rightly, because of its incompleteness. 

But these data were not wholly neglected by medical writers as we shall see 
in later sections. One may fairly say on the evidence here summarized that the 
eighteenth-century political arithmeticians of England made no advance what¬ 
ever upon the position reached by Graunt, Petty and King. They were second- 
rate imitators of men of genius. 


(To be concluded) 
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1. Introductory 

When numerous organisms or organs are -weighed, the distribution of the weights 
is often positively skew. On the other hand, the distributions of linear measure¬ 
ments is often very close to normal. The question then arises, given that the 
volume of an organ is proportional to the product of three mutually perpen¬ 
dicular measurements, each being normally distributed, what will be the dis¬ 
tribution of the volumes 1 In general the three linear measurements will be 
correlated, and the problem might appear hopelessly complex. However, it will 
be shown later that, provided certain conditions are fulfilled, the distributions 
all lie very close together. 

The problem can obviously he generalized to cover the case of the product of 
any number of normal variates. The most interesting cases are those of two, 
three, or an infinite number of such variates, Further, two special cases are com¬ 
paratively simple. When the coefficients of correlation are all equal to unity and 
the coefficients of variation equal we are concerned with the distribution of a 
power of the normal variate. When the correlations all vanish, we are concerned 
with that of a product of several uncorrelated normal variates. 

It is not, of course, suggested that all skew variation of weights is to be 
explained on these lines. For example the Galton-Macalister distribution, in 
which the logarithm of the variate is normally distributed, can be thought of as 
arising in at least two different ways. The weight may be the product of a large 
number of normal variates; or for constant cell size, the number of cell generations 
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may be distributed in a certain manner about a mean in each organ, these numbers 
being normally distributed. The highly skew variation of human weights is 
probably to be explained by the fact that a rather small fraction of the human 
race lays down very large quantities of fat. Nevertheless, it will be shown that 
simple criteria, will determine whether observed positive skewness and lepto- 
kurtosis, too large to be ascribed to sampling error, can be explained on the lines 
discussed above. And in any particular case it is worth while finding out whether 
this is so. 

Anymeasure of asymmetry, such as y x = or of kurtosis, such as y 2 = As - 3, 

is a dimensionless number independent of the unit of measurement. Hence in a 
transformed normal distribution of the type here considered it must clearly be 
a function of the only dimensionless number derivable from the first two moments, 
namely, the coefficient of variation, c. In what follows we shall generally use m 
for the mean of the original normal distribution and km 2 for its variance, so that 
Ik- is the coefficient of variation. The usual notation is used for the mean, variance 
and other moments and cumulants of the derived distribution. 

The distribution of the product of a pair of correlated normal variates has 
already been fully discussed by Craig (1936). If m x and m 2 are the mean values 
of X and Y, \ and k 2 their coefficients of variation, and p their coefficients of 
correlation, then Craig finds for the cumulant generating function of 


XY 


vq 7/i ^ *J(k x /r 2 ) 


6 


/J_ 1 \ e | 2 

_ #iMJ -1 log [L - (1 +p) d] -1 log [1 + (1 - p) g]. 


K{0) ' ' 2[l-(l+p)0][l + (l~p)d] 

If k x = 1c 2 = k, this becomes 

6 


K (&) — 


The rth cumulant of 


*[l-(l+p)0] 
XY 


ilog [1 - (1 +p) 0]-*log [1 + (1 -P) 0}. 


«r = |r!^{(l+p)«- 1 -(p- 


V(A As) 


+ {(1 +P) n ~ (P~ ! K 1 +P> n + (P-W 

If k x = k 2 ~ k, the rth cumulant of AT is 

k t — (m x [(1 + pf- 1 r ! + p {( 1 + p) r + (p - 1 ) r } (r -1 )!]. 

Craig’s discussion is mainly confined to the cases where k x and & 2 are large. In 
the cases of greatest biometrical interest they are small, and some points of 
interest arise, besides those dealt with by Craig. 
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2. Distribution- or the oube of a normal variate 

If x be a reduced normal variate, that is to say, a variate whose mean is zero 
and standard deviation unity, it is required to find the distribution of X 3 , where 
X — m( 1 + /Ac). We note that 

0, and« 5F = (2r-l)(2r-3)(2r-5)...3.1 = 

Hence 

X 3 = m a ( 1 + 3**5; + 3kx 2 + k*x 3 ) — m 3 (l + 3k), 

X 6 — wi 8 ( 1 + (Wx +1 5kx 2 + 20**3* + 15* 2 3 4 + 6 *he 5 + * 3 »; 6 ) 

=f m 6 fl + 1 5k + 45* 2 + 15* 3 ), 
so that the first four moments of X 3 about zero are 


p\ = m 3 (l + 3k), 

Hz — m 4 ( 1 + 15* + 45fc a + 15* 3 ), 

H' 3 = m 9 (l + 36 k + 378/c 2 + 1260/c 3 + 945* 4 ), 

/*i« m 12 (l +66*+1486* 2 +13860* 3 + 51975* 4 +62370/c 5 + 10395/c fi ). 
Hence the moments about the mean m 3 (l + 3k), and the oumulants, are 


- m 3 (l + 3*), 

k 2 — /i 2 = 3m B *(3 + 12* + 5k 2 ), 


* 3 = = 54m 9 * 2 (3 +16* +15* 2 ), 

ft = 27m 12 * a (9 + 240*+ 1326* a +1920* 3 + 385/A), 
K i = 648m 12 * 3 (7 + 48* 4- 75/s 2 + 157c 3 ). 


( 1 ) 


It is to be noted that in these calculations, and in the majority, though not 
all, those of this paper, the expressions for the moments about the mean are 
simpler than those for the moments about zero. Successive moments about the 
mean are therefore best calculated from the former, so far as possible, rather 
than the latter. Thus the equation 


= / 4 -K(¥s + With + Pi), 

involves less algebra than the more usual 


Pi — Pi "f fi/q 2 / 4 ® 3/./j 4 . 

It follows that 

cr = 3m 3 ** (1 + 4* + |* 2 )*, 
(9fc+36* 2 + 15 * 3 )* 

C “ 1 + 3 * 

+ 6 f(3k) (3 +16* + 157c 2 ) 

71 (3+12* + 5* a )» 

+ 72*(7 + 48 * + 75* 2 + 15 * 3 ) 

72 “ (3 + I2k +5k 2 ) 2 



J. B. S. Haldane 229 

Hence the distribution is positively skew, and leptokurtic. If c is small, we 
have, approximately, 2^ 

yi = 2c +— 

56c 2 16c 4 

ra = “9~ + '2T + 

Since in most practical cases c 2 < OOl, the first term will have an error of less 
than 0-1 %, and no attention need be paid to the later terms. 



3. Distribution or any power or a normal variate 


It is required to find the moments of m n { 14- k*x) n . The mean value of its rth 
power is clearly 

( nr V- k , ("*•)»** , . i 1 

L 2(nr-.2)! 8(nr-4)! + ‘” 2».«!(w-2«)l^"'J‘ 

This converges for all positive values of n, since k is small, though it only 
terminates for positive integral values of nr. After somewhat tedious algebra, 
we find for the mean and other moments and cumulants: 

K t - = rn"-\l + \n(n- 1)&+...], 

K n — Aa = m in n 2 k[\ + \(n- 1) (3n—5) 

k 3 - n- A = 3m 3n w 3 (w- l)fc a [l + ^(17w 2 -55n + 44)ifc+...], . (3) 

ft i = 3m 4n ?i 4 fc s [l + |{w— l)(5n.—7)fc + ...], 

= 4«i 4 "ra 4 (w — 1) (in — 5) P[1 + 0(A;)], 

Hence c — + £(r - 1) (3»- 5)k + 

y 1 — 3(n— 1)H[1 + ^(7 n 2 - 38»+43) k+ ...], 
y 2 = 4(»-l)(4m-5)fc[l + 0(fc)]. 

And when c is small ,,, ,, , 

7l -fcl) c[1 + o(c»)], 1 

T ^4( M -l)( 4»- 5 K [I + 0(c , n | 

When n — 2, the expressions become very simple, 

— m 2 (l + k), 

/c 2 = 2 m*k(2 + k), 

k 3 = 8m«ifc 2 (3 + fc), ( 5 ) 


Hence 


Kf = (r — 1 )! OT 8r (2ifc)’‘- 1 (r + k).) 

This is in accordance with Craig’s formula, putting fc x — k 2 = k, p ~ 1. 
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4. Distribution or the product of three independent 

NORMAL VARIATES 


Let x, y, z, be independent reduced normal variates, so that x 2 = y 2 — z 2 — 1, 
x i = y i = z i — 3, etc. Let X= rn^l + ^x), Y=m i (l + k i y) > Z = m 3 (I + fc|z). 
Here k\, *|, k\ are coefficients of variation of linear measurements, so that *,, * 2 , * 3 , 
are commonly about 0 - 001 , It is required to find the moments of the distribution 
of XYZ. In the expansion of a power of X YZ we need only consider even powers 
of x, y and z. For example, 

X 2 Y 2 Z 2 = + k\x) 2 (1 + k\y) 2 (14- *|z) 2 

= m\m\m\(\ J rk 1 x 2 ){\. J r'k i y 2 ){\ J rk 3 z 2 ) 

= + k x ) (1 + * 2 ) (1 + fc 3 ). 

So if — V (a volume), the moments of XYZ about zero are 


= F 2 (l + *i) (1 + *2) (1 + * 3 ), 

/4 = 73(1 + 3^) (1 + 3*,) (1 + 3^), 

/** = F 4 (l + 6*3 + 3k 2 ) (1 + 6 * a + 3*|) (1 -I- 6 * 3 + 3*|). 

So k 2 = /i 2 = F 2 (F* 1 + 27*3*2+*3*2*3), 

X 3 = /4 3 = 6F 3 (27*3*2 + 4*3 * 2 * 3 ), 

/t 4 = 3V i (Eh\ + 227*3*2 + 627*|* 2 + 38*3*2*3+ 327*|*| 

+ 3627*1*2*3+ 1827*|* 2 *3 +9*f *1*§, 
k 4 = 6F 4 [227*f fe 2 +27* 2 * 2 + 4*3 * 2 * 3 (4 + 427*3+ 227*3*2 + *3*3*3)]. 

In practice we can neglect all terms except the leading one, and write 

«i = V, 

k 2 = F 2 (*3 + * 2 +* 3 ), 

K 3 = 6F 3 (*2* 3 + *3*2 + *3*2), 

Ar 4 = 12F 4 (* 2 *2 + *| *3 + *| *3 + *3 *2 + *2 *3 + *3 *f + 8*3 *2 *3). 

If *3 = * 2 = * 3 = *, we find 

k 2 = F 2 *(3 + 3* + * 2 ), 
k 3 = 6F 3 * 2 (3 + 4*), 
at 4 = 6F 4 * 3 (28 + 51* + 24* 2 + 4* 3 ),, 
which may be compared with equations ( 1 ). Thus 

c = ^(3*+3 * 2 + * 3 ), 


( 6 ) 


(7) 


56c a 22c 4 
7a_ ' 9 9 


■f . . .. 


(8) 
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It will be noticed that the leading terms of equations (2) and (8) are identical. 
Hence the two distributions will, in practice, be quite indistinguishable, even if 
tens of thousands of individuals are weighed. It follows that the cube root of the 
product of three independent variates is almost normally distributed. Further, 
the relations of y l and y 2 to c are little altered when k u k % and k s are different, 
provided that they are of the same order of magnitude. Thus if Aq = 0-001, 

11 16r 2 

k 2 = 0-002, & 3 sa 0-003, we find y, = —-o instead of 2c, and y 2 = —r— = 5-3c 2 

3 o 

instead of 6-2c 2 . Only a very large sample indeed would reveal departures of this 
order from the simpler expressions of equations (2) and (8). 


5. Distribution or the product op n independent normal variates 

Let x v x 2 , x 3 , ..., x r , ..., x n , be n independent reduced normal variates. The 
general case is of course somewhat complicated, so we shall only investigate the 
special case where all the coefficients of variation are equal. 

Then if X r — m r (l + k k x r ), and M = JJm r , the moments about zero of the 
distribution of the product TIX r are 

= M^i+kr, 
flit = M*(l + 3lc) n , 

/4 = Jf 4 (l + 67c+3& 2 )*, 

H' h = M 5 ( 1 + lOfc + 15& 2 )”, 

/4 = M*{ 1 +15jb + 45fc 2 +15fc s )". 

Hence 

/i 2 = iif 2 [(l + fc)»-l}, 

/t 3 = M 3 [(l + 3k) n — 3(1 + k) n + 2], 

^ = M\( 1 + 3k + 3**)» - 4(1 + 3 k) n +6(1 + k) n - 3], 

/i 6 = iif 5 [(l + 10fc + 15Pf - 5(1 + 6k + 3k 2 ) n + 10(1 + 3 k) n ~ 10(1 + &)»- 4], 

= M e [( 1 + 15fc+ 45& 2 + 15& 3 ) re — 6(1 + lOfe-f 1 6k 2 ) n + 15(1 + 6k+3k 2 ) n 

- 20(1 + 31c) n +15(1 + kf - 5], 

k 4 = ihT 4 [(l + 6 k + Zk 2 ) n —3{\ + 2 k+k 2 ) n - 4(1 + 3 k) n + 12(1 + k) n ~ 6], 

* 6 = W 5 {( 1 + lOJfe + 15F)« + 5[ - (1 + 6 lc+ 3k 2 ) n -2(1 + 4 k+ 3 fc 2 )» 

+ 6(1 + 2 k + k*) n + 4(1 + 3jb)» -12(1 + k) n ] + 24}, 

/c a = 1 +1 5k + 45 k 2 + 15 k 2 Y +15[2( 1 + 3& + 3fc 2 + k z ) n 

- (i + + 9 fc 2 + sk 3 ) n + 2(1 + 6k + 3& 2 ) w + 8( 1 + U + 3k 2 ) n 

- 18(1 + 2 fc + k 2 ) n - 8(1 + 3k) n + 24(1 + k) n — 8] 

- 6(1 + lOi + 15 k 2 ) n ~ 10(1 + 6k + 9P)»). 
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The higher cumulants can be evaluated as follows: 

*3 (V 1 - l)lf, 

k 4 = UP £ j (3 r_1 -1) [(2 + ky - 4] If, etc. 

' Thus we find 
= Jf, 

k 2 = M*nJc[l + —1) k + ■ ••]. 

k 3 = Jf %(ft - 1) /fc 2 [3 + 4(» - 2) * +... ], 

a r t = M^n— l)F[4(4ft — 5) + 3(13ft 2 — 49ft + 47) & + 

k 5 = 5M s w(w -1) fc 4 [(5n- 6) (5n-7) + 2(w - 2) (143n - 345) (ft +...], 

K a = 3lf 6 ft(ft-l)£ 6 [432ft 3 -1853w 2 + 2917ft- 1758 + 0(A)). 

These may be compared with equations (3). 

c = f(nk)[l + \(n-l)k +...], 
and when c is small ,, 

fO 

as in equations (4). 

When ft = 2 we have the simple forms 
/c 1 = JT, 

= JT 2 fc(2 + &), 

. /c, - eilfV, 
at 4 — dM i k 3 (4: + k), 


K ir = (2r -1)! M ir lc ir ~\2r + k), 

/r 2r+1 = (2r + l)! Jf 2r+1 A: 2r . 

This is in accordance with Craig’s formula, putting k t - k 2 = k, p - 0. 

6. Distribution of the product of two correlated normal variates 

In general the different linear dimensions of an organ or organism are posi¬ 
tively correlated. Organic correlations may reach very high values, such as 0-9, 
and presumably even higher values would be found for two approximately equal 
diameters of an approximate solid of revolution, such as an apple or an egg, where 
p may be taken as unity. On the other hand, quite low values are found. Thus the 
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length and breadth of a homogeneous group of like-sexed adult human sk ulls 
generally show a correlation of about 0-3 or 0-4. Hence the area of an organ will 
commonly be proportional to the product of two correlated variables, the 
volume to the product of three. 

Let x and y be two reduced normal variates, with correlation p. The cumulant- 
generatlng function for their joint distribution is l(t i +2ptu + u i ). That is to say 

- # -_ fT^8 

if r + 8 is odd, then x r y e = 0. If r + s is even, then x T y s is the coefficient of ~r—r in 

Hs! 


exp i(t 2 + 2ptu + u 2 ). That is to say, a fy s is the coefficient of t'u* in 




{t 2 + 2ptu + % 2 ) i( - r+a) , 


multiphed by 


r\s\ 


It follows that 


2^+a)[J( r + s )]r 


x 2 = 1, 

xy = p, 



x l ~ 3, 

x a y = 3 p, 

xY — 1 + 2p 2 , 


%{ 

II 

Ol 

x*y = 15p, 

xY = 3(1 + 4p 2 ). 

xY = 3p(3 + 2p 2 ), 

x 9 = 105, 

x 1 y = I05p, 

xY — 16(1 + 6p 2 ), 

xY — 15p(3 + 4p 2 ) 


xY = 3(3 + 24p 2 + 8p 4 ), 


xY = 105(1 + Sp 2 ), x"'y 2 — 315/>(1+ 2p 2 ), xY — 45(1 + 12p 2 + 8p 4 ), 

xY - 315(1 + 16p 2 + 16p 4 ). 

Of course x?y r — afy s . 

Thus the moments of xy about its mean p are 

IH = 1 +p\ Pz = 2p( 3+p 2 ), p x = 3(3 + 14p 2 + 3p 4 ). 

Hence /c 4 = 6(1 + 8p 2 +p 4 ). Hence the distribution is leptokurtic, and asym¬ 
metrical unless p vanishes. 

Now consider two correlated normal variates 

X = m x (l + ax), and Y = t«. 2 (l + by), 

where a and b are coefficients of variation. Let = A. Then the moments of 
X Y about zero are 

X = A{\+pab), 

X = -4 2 [1 + a 2 + 4pa6 + 6 a + (1 + 2p 2 ) a 2 6 2 ], 

X = ^ 3 [1 + 3(a 2 + 3pab + b 2 ) + 9ab{pa 2 + (I + 2p 2 ) ab + pb 2 } + 3p(3 + 2p 2 ) a*b% 

'X - d 4 [l + 2(3a 2 + 8 pab + 3b 2 ) + 3pa 4 +16 pa*b +12(1 + 2p 2 ) aW+ 16pah® + 3 pb* 
+ 6« 2 fe 2 (3(l + 4p 2 ) a 2 +8p(3 + 2p 2 } ah+ 3(1 + 4p 2 ) b 2 } + 3(3 + 24p 2 + 8p 4 ) a 4 h 4 ]. 
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Hence 

k 2 = /6 a = .A 2 [a 2 +2pa6 + 6 2 +(l+p 2 )a, 2 6 2 ], 

k 3 = fi s = 2A !i ab[B{pa 2 + (l+p 2 )ab+pb i }+p(B+p 2 ')a 2 b% 

p 4 = 3H 3 [(a a +2p c j6 + 6 2 ) 2 + 2a 2 & 2 {(3 + 7p2)a 2 + 2p(7 + 3p 2 )a6 

+ (3 + 7p 2 ) 6 2 } + (3 +14p 2 + 3p 4 ) a 4 * 4 ], 

x 4 - 64W[2{(l + 3p 2 )a 2 + 2p(3 + p 2 )a?> + (H-3p a )6 a } 

+ (l + 6p 2 + p 4 )a 2 6 2 ], (10) 

in accordance with Craig’s formula. 

The most interesting case arises when a — b ~ feh This case is important 
because in practice the coefficients of variation of different linear dimensions of 
the same organ are often nearly equal. Thus those of linear skull measurements 
in like-sexed adults in a racially homogeneous population are about 0-03, so that 
k is about 0-001. The moments and cuinulants of XY are then 


k 1 = A(l+pJc), 

k 2 = H = A 2 k[2{l + p) + (l + p 2 ) k], 

* 3 = P 3 = 2.4 3 & 2 [3(l+p) 2 + (3 + p 2 )fc], 

p 4 = 3H 4 fc 2 [4(l + p) 2 + 4(3 + 7p + 7p 2 + 3p 3 ) k + (3 + Up 2 + 3p 4 ) fc 2 ], 

* 4 - 6Pl 4 P[4(l+p) 3 +(l + 6p 2 +p 4 )£]. (11) 

Hence 


:-vt2(n-rtt][i + i . 4 y + >h +..,; 


while 


Vi — l°[l + 0(c 2 )], 

7 2 — 3c 2 [l + 0(c 2 )], 

exactly as found when the two variates are in a constant ratio or quite indepen¬ 
dent, When however a =)= b, this is no longer the case, for it is clear that the distribu- 

tion becomes normal when a or b vanishes. If-—- , -- = p, so that p > 1, we have, 


for equations (10), 
7i = 


1 + 2pp + p 2 3c 
~ iP + P) 2 


7z = 


2a6 

1 + 3 pp - 3p 2 +pp 8 
( P+P? 


3c 2 , 


both approximately. However, a and b may differ considerably without any very 
great effect of yj/c or y 3 /c. Thus if a - 26, so that p = 

8(2+hp + 2p 2 )3c 
7l ~ (5 + 4p) 2 2' 


That is to sfiy, y x is 64 % of 3c/2 if p = 0, and 86 % if p = whilst y 2 is 51 % of 
3c if p = 0, and 71 % if p = £. 

So far we have assumed that p is not negative, If p = - 1 , X Y ~ 1 - kx 2 , 
so the mean is ^4(1 - k), and the other eumulants are given by 


K r = (-ya^r-l)!^. 
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If p = — 1 + 2+ +k s in equations (11) vanishes, though the distribution 

does not become quite symmetrical. However, negative values of p are of no 
biological interest. 

7. Distribution or the product of three correlated normal variates 

Let x, y, z, be three reduced normal variates as before, their correlations being 
p vz — k, p zx = A, p xv = y. Thus the cumulant-generating function is 

i(£ 2 + % 2 + v 2 + 2 kuv + 2A vt + 2ytu ). 

Odd moments vanish, and the even moment x p y q z r is the coefficient of t^u^v' in 
(t 2 + u 2 + v 2 + 2kuv + 2A vt + 2ytu)^ p+a+r \ 

multiplied by + ' 

The required moments of products of two variates, suoh as 
xy = y, x^y 2 = 3(1 + 4/i 2 ), 

have already been given. The required moments of products of all three are: 
x 2 yz = * + 2Ay, 

x^yz - 3(* + 4A//,), xSjH ~ 3(A + 2/c/i. + 2* J M 2 ), x 2 y 2 z 2 = 1 + 227* 2 +8*A,m, 

xAyh = 3(3* + 12A/4 + 12*/t 2 + 8A/i 3 ), 

xhfz 2 = 3(1 + 2k 2 + 4A 2 + 4/t 2 + l6*A/t + 8A 2 y 2 ), 

x 2 yh 2 = fifty + 6*A + 2p s + 6 K 2 y + 6\ 2 y + 12 k Ay 2 ), 

xiyi z !i — 3(3k q. 12/c 2 + 12A 2 + 24/i 2 + 96k Ay + 48* 2 /t 2 + 48A 2 /i 2 + 8/t 4 + 64/cA/f- 3 ), 

x 2 yH 2 = 9(3*+ 12A/t+ 2* 3 + 12*A 2 + 12*/t 2 + 24* 2 A/t + 8A 3 /t 

+ 8A 3 /i + 8A/t 3 + 24*A 2 /i 2 ), 

xYz* = 9[3 + 242k 2 + 827* 4 + 9627AV 2 + 64<Ayft + 22k 2 + 3*A/i)] . 

Other moments (except those such as xy 2 z — A + 2*/t, which are derivable by 
transposition) are not needed for our purposes. It follows that xyz has a sym¬ 
metrical and leptokurtio distribution, with mean zero, and 
* 2 = y- 2 = 1 + 22k 2 + 8 kA y, 

* 4 = 12[2 + 1727* 2 + 527k 4 + 102A 2 y 2 + 4*A/i(35 + 222k 2 + 32*A/i)]. 

If X - m x {l+ax), Y = m^l + by), Z = m 3 {l+cz) the expressions for the 
higher moments are very complicated, though it can easily be shown that the 
mean is m 1 m 2 w 3 (l + *6c + Aca+ yab), and the variance 

m 2 m 2 m 2 [2a 2 + 22 K bc + 2( 1 + * 2 ) 6 2 c 2 + 227(2* + 3A y) a 2 bc 

+ (1 + 227* 2 + 8*A/i) a 2 b 2 c 2 ]. 

16 
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We shall only give further consideration to the case where all the coefficients 
of variation are equal. And we shall confine ourselves to two special cases of this. 
In the one k = A = p — p, that is to say the variates are equally, and therefore of 
course positively, correlated. This is not very far from the case with the human 
skull. In the other, k = 1, whilst A = p = p. This is appropriate to an approximate 
solid of revolution, such as many eggs and fruits, or to a regular prism such as 
some sponge spicules. 

In the first case we have for the moments of products of three variables: 
xh/z = p(l + 2p), 

x^yz = 3p( 1 + ip), x s y 2 z = 3p( 1 + 2p + 2p 2 ), x 2 y 2 z 2 = 1 + 6p 2 + 8p 3 , 
a; 4 y 3 z = 3p(3 + 12p + 12p 2 + 8p a ), zb/V ~ 3(1 + 10p 2 + 16p 3 + 8p 4 ), 
x 3 y 3 z z = 3p(3 + 6p + 14p 2 + 12p 3 ), 
xSfb 2 = 3(3 + 48p 2 + 96p 3 + 104p 4 + 64p B ), 
xhfz 3 = 9p(3 + 12p + 26p 2 + 40p 3 + 24p 4 ), 

X^z* = 27(1-1- 24p 2 + 64p 3 + 104p 4 + 128p 6 + 64p 6 ). 

So if X = mfl + kix), Y= m 2 (l + Fp), Z = wi s (l + Fz), m 1 m i m 3 — V, 
the volume which is the product of the means, then the moments of XYZ about 
zero are 

= F(l + 3pi), 

J*t - F 2 [l + 3(l + 4p)fc + 3(l + 4p+10p 2 )&+(l + 6p 2 +8p 3 )F], 

Pa = F 3 [l + 9(l+3p)A: + 27(l + 5p + 8p 2 )F + 9(3 + 21p + 54p 2 

+ 62p 3 ) F + 27p(3 + 6p + 14p 2 + 12p 3 ) F], 

pi = F 4 [l + 6(3 + 8 p)k + 9(13 + 64p + 88p 2 ) F + 36(9 + 64p + 160p 2 

+ 152p 3 ) F + 27(13 + 128p + 448p 2 + 768p 3 + 568p 4 ) F 
+ 54(3 + 24p + 144p 2 + 304p 3 + 424p 4 + 256p 6 ) F 
+ 27(1 + 24p 2 + 64p 3 + 104p 4 + 128p B + 64p e ) 7c®]. 

Hence 

K i e =/4 = F(1 + 3pfc), 

= Pi = F a fc[3(l + 2p) + 3(1 + 4p + 7p 2 ) k + (1 + 6p 2 + 8p 3 ) F], 
x 3 = p 3 = 6F 3 F[3(1 + 2p) 2 + (41 + 27p+ 60p 2 + 53p 3 ) k 

+ 3p(4 + 9p + 8p 2 +14p 3 )F], 

p 4 = 3F 4 F[9(1 + 2p) 2 + 2(37 + 222p + 471p 2 + 350p 3 ) Ic + 3(39 + 316p 
+ I038p 2 + 1684p 3 + 100 lp 4 ) k 2 + 8(3 + 24p + 127p 2 + 268p 3 
+ 346p 4 +192p 5 ) F + 9(1 + 24p 2 + 64p 3 + 104p 4 +128p 5 + 64 p») F], 

Ki = 6F 4 F[28(1 + 2p) 3 + 3(17 + 144p + 468p 2 + 688p 3 + 411p 4 ) k 

+12(2+ 17p + 92p a + 193p 3 + 241p 4 + 130p 5 ) F 
+ 2(2 + 51p 2 + 140p 3 + 225p 4 +264p 5 + 128p 6 ) F]. (12) 
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These expressions reduce to equations (1) if p - 1, and (7) if p = 0. When k 
is small, c 2 = 3(1+ 2p) 1c, ^ = ^ + 

KA /.2 

r 2 =-~[i+o(c 2 )]. 

Thus the relation of y 1 and y 2 to c is almost independent of the coefficient of 
correlation. 

We next consider the case when z = y, so that k = 1, whilst A ~ ji — p. This 
is most simply solved by finding the moments of (1 + Ux) (l + Fy) 2 . Thus if 
X = m x ( 1 + fc-x), and T = m 2 (l + Fp), the moments of X F 2 about zero are 

/4 = F[l + (l+2p)fc], 

M» = F 2 [l + (7 + 8p)fc+3(3 + 8p + 4p 2 )F+3(l + 4p 2 )F], 

K = F 3 [l + 18(l + p)fc + 18(5+ llp + 5p 2 )& 2 + 30(5 + 15p + 18p 2 + 4p 3 )/c 3 

+ 45(l + 6p + 6p 2 +8p 3 )F], 

/4 = F 4 [l+ 2( 17 + 16p)fc+ 3(127+ 256p + 112p 2 )fc 2 + 84(21+ 64p + 64p 2 
+ 16p 3 ) F + 105(31 + 128p + 192p 2 + 128p 3 + 16p 2 ) F 
+ 630(3 + 16p + 32p 2 + 32 p 3 + 16p 4 ) F +315(1 + 16p 2 +16p 4 ) F]. 

Hence 

k= F[l + (l + 2p)/fc], 

k 2 = fi 2 - F 2 fc[6 + 4p + 4(2 + 5p + 2p 2 ) k+ 3(1 + 4p 2 )fc 2 ], 

k 3 = p 3 = 2F 3 F[3(8 + 14p + 5p 2 ) + 2(29 + 84p + 87p 2 + 16p 3 ) * 

+ 9(2+ 14p + 13p 2 + 16p 3 ) *>■], 

p 4 = 3F 4 & 2 [(5 + 4p) 2 + 8(40 + 113p + 95p 2 + 22p 3 ) k + 2(427 
+ 1628p + 2376p 2 +1376p 2 + 160p 3 +160p 4 )F 
+ 24(24+ 121p + 237p 2 + 234p 3 + 104p 4 ) F+105(1 + 16p 2 + 16p 4 ) F], 

x 4 = 24F 4 F[30 + 80p + 65p 2 + 14p 3 + (95 + 364p + 513p 2 + 292p 3 

+ 32p 4 ) & + 3(22 + 116p + 227p 2 + 2l4p 3 + 96p 4 ) k 2 
+ 3(4+67p 3 +64p 4 )F]. (13) 

Hence, approximately, 

c 2 = (5 + 4p) k, 

_ 6(2 + p)(4 + 5p)g 

71 (5 + 4p) 2 

24(30 + 80p T +65p 2 + 14p 3 ) c 2 

72 ~ (5 + 4p) 3 

Hence y 1 varies between ffc, or l-92o when p = 0, and 2c when p = 1. This is 
its maximum, so for high values of p, yjc is nearly constant. It vanishes when 
p = _o-8. y 2 increases from ^c 2 , or 5-76c 2 , when p = 0, to -^c 2 , or 6-2c 2 when 

16-2 
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p = 1. This again is its maximum, so y 2 /c 2 is nearly constant for large p. y z does 
not vanish for any admissible values of p. In fact for positive values of p it would 
be impossible except with enormous samples, to distinguish this distribution 
from that of the cube of a normal variate. If p = 1, equations (13) reduce to 
equations (1). If p ~ 0, they give the eumulants of XY 2 , where X and Y are 
uncorrelated normal variates with the same coefficient of variation, namely, 

K t ~ V{l-\~lc), k 2 = F%(5 + 8* + 3& 3 ), 

= 4F 2 /fc 2 (12 + 29 h + 9 k% = 24F 4 /fc 3 (30 + 95* + 66 W + 12 1c 3 ), 

which may easily be obtained independently. 


8. The product op n correlated normal variates 


If the variates are X t , X 2 , X r , ..., X n , where X r = m r (l +a r x r ), and 
P — TIm T , while x r is a reduced normal variate, and p rs is the coefficient of corre¬ 
lation of x r and x g , and hence of X r and X s , the general expression for the moments 
of X v X z , ..., X n is complicated. It can however easily be seen that the mean is: 

P[l + Xp rs a r a s + X[p ra p lu + p H p su +p n p£ a T c W i u +...], 
while p % = P 2 [2a 2 4- 2 Xp rs a r a s +...]. 


If every a r ~ and every p r8 — p, then xf l x%*x$’... vanishes when m = Soc r 
is odd, and when m is even it is the coefficient of tptp... in the expansion of 
(Xt 2 + ‘2pXt r t s ) m , multiplied by a 1 \a 2 la 3 \/(2 m rnl). Thus the moments about 
zero are: 




So 


l + \n{n-l)pk+± s n{n-l){n~2) {n - 3)p 2 fc a +... + 2 r rT(n- 2 r )! + '''] ’ 

Pa - P 2 [l+a(l + 2(n-l)p}fc + ^n(w — 1) 

x {1 + i(n— 2) p 4- 2(2n a — 6n+ 5 )p 2 }& 2 + ...], 
Ps = JP 3 [1 + 3n{l +1(« - lp} fc + %n(n - 1) 

x {1 + (37i— 4)p + ^(9w 2 — 21n + 14) p 2 }& 2 + ...], 


Pa = P 2 nk[ 1 + (n~ 1) p + \(n- 1) {1 + 4(w, - 2)p + (3 n 2 - 9w + 7) p 2 } k + .. 
p 3 = ZP 2 n{n-l)[l + (n~l)pVc 2 [l + 0{k)l (14) 

Thus c = [«{1 + (n -1) p} *]* [1 + 0 (ft)], 

T + (tc-1)p1* 


Vi 


= 3(.-l)[l±i^J 


[1 + 0 (&)] 


( 16 ) 


n 


c 2 [l + 0(c% 


and presumably y 2 etc. are approximately the same functions of c as in the case 
of equations (9). 
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9. The Qalton-Macalister distribution 
C onsider the distribution of the nth power of a normal variate when n ia very 
large, and k very small, but nkS remains constant. All but a few terms of equa¬ 
tions (3) will vanish, and if nk i ia small, the distribution becomes identical with 
the distribution described by MacAlister (1879) whose first four moments have 
been given by Pearson (1905). Its cumulants can readily be found as follows: 
Given that x = log X is normally distributed, to find the distribution of X. Let 
in and s be the mean and standard deviation of x. Then X = e x . Hence, since the 
moment-generating function of x is M(t) = e ml+i * l \ the rth moment of X about 

zero is , — r a„ 

H,. = e rx = 1 + r/t 1 + ~p+ ... = M(r) = e mr +l ar \ 

where /q, /q, etc. are the moments of x about zero. Let M = e' ,n+ ' ia , l — e s . Then 
= M r l ir{r ~ 1) . Hence the cumulants of X are 

= M, 

k 2 = 

at 8 = Jf 3 (Z-l) 2 (Z+2), 

k 4 = M\l - l) 3 (Z 3 + 3Z 2 + 6Z + 6), 

k b = M fi (Z - 1 ) 4 (Z 6 + 4Z B + 1OZ 4 + 20Z 3 + 30Z 2 + 36Z + 24), 

k 6 — M 6 (l — 1) B (Z 10 + 5Z 9 +1 51 s + 35 P + 70 Z 3 +120Z S + 180Z 4 

+ 240Z 3 + 270Z 2 -f 240Z +120), (16) 

or, if Z — 1 is a small quantity, q, approximating to s, 

K x — M, 
x 2 = 

k 3 = M\\3 + q), 

k 4 = JlfV(16+15g+6g 2 + g 3 ), 

k 5 = df 5 g 4 (125-t-222g , -(-205g a + 120g ,3 -)-45g 4 + 10g s +-g 6 ), 
k 6 = ibrY( 1296 + 3660 3+ 5700 ? 2 + 5165 2 ,3 + 4946 ^ 4 + 299 V 

+ 1365g® + 465g 7 + 105g 8 + 15g 9 + g 10 ). 

Hence c 2 = q, 

y 1 = 3c 4- C 3 , 
y 2 = 16c 2 + 15c 4 +..., 
y 3 = 125c 3 + 222c 6 + ..., 
y 4 = 1296c 4 4- 3660c 9 + .... 

The first terms represent the limiting values of equations (3) and (9). The 
leading term of y r up to y 4 at least, is (r + 2) r . Thus for a given coefficient of 
variation, y x is 50 % greater than in the ease of the cubed normal variate, y 2 is 
157 % greater. 
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10. Biological applications 


Wilson and Hilferty (1931) showed that the cube root of y 2 is almost normally 
distributed when n exceeds 2. Haldane (1938) gave the value of the cumulants 
in this case, and showed that, for large values of n, (1 By 2 — n) ‘ 3 is even more nearly 
normally distributed. It is interesting to compare the cumulants of the cubed- 
normal distribution [equations (2)] with those of the y 2 distribution. The latter 

12 

are /c x = n, x 2 = 2 n, k s = Sn, /c 4 = 48 n, etc., so that c = y x = 2c, y 2 = 6c 2 . 

M 7b 

Thus if we compare the y 2 distribution with a cubed-normal distribution of the 
same coefficient of variation, we find that they are equally asymmetrical, but 
that the y 2 distribution has a y 2 which is f|-th that of the cubed normal. Hence 
the same transformation will nearly abolish both k 3 and x 4 for both distributions. 

The success of Wilson and Hilferty’s transformation suggests strongly that 
we may use equations (1), (6) or (3) for the approximate normalization of moder¬ 
ately skew variates. This is an urgent problem in several applications of statistics 
to biology (Haldane, 1939). We evaluate a number whose mean and standard 
deviation in the case of random sampling are known. We desire to know whether 
it differs significantly from the mean. But the sampling distribution is found to 
be skew. If we can approximately normalize it, our tests of significance become 
far sharper. This problem is taken up in detail elsewhere. If y 2 > fa and both 
are small, so that k 2 can he neglected, we can find m, n and k so that {m + k i x) n 
has a given y 1 and y 2 . By equations (3) and (4), 


n = 1 + 


16yf — 9y a ’ 




y\ 

9[n~ l) 2 ’ 



If, however, y x is not small, it is necessary to take several terms of equations (3), 
If in an observed distribution of weights or volumes, the estimate of y x /c, 
or of Kj/Cg/x! is approximately 2, it will be reasonable to try whether y 2 /c 2 or 
k\kJkI approximates to and if so to try to fit a normal distribution to the 
cube roots of the variate. It will be seen that this does not imply that all the 
objects considered are of the same shape. On the-contrary, such a distribution 
is to be expected whenever three mutually perpendicular measurements are 
normally distributed, provided that their coefficients of variation and correlation 
are not very different. And even the greatest differences in the latter, provided 
they are not negative, will have little effect. 

Unfortunately, reliable estimates of y 1 and y 2 can only be obtained from 
samples of the order of 1000 or more. Rendel (in a paper to be published shortly) 
obtained the following estimates for the cumulants of the weight distribution of 
1202 viable duck eggs, corrected for grouping. The unit is a gram: 

= 73*294, & 2 = 42-69, k 3 = + 118-668, jfc 4 = +937-21. 
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Hence the estimates of c, y i and y 2 are 

c = 0-0887±0-0018, ft =+0-432 +0-071, ft = +0-524 + 0-142. 

The standard errors are those appropriate to a normal distribution, but the true 
values cannot be very different. It is clear that ft and ft are significantly greater 
than the values of 0-27 and 0-13 which we should expect were the logarithms 
of weights normally distributed. They differ still more from what would be ex¬ 
pected were the cube roots of weights normally distributed, or on any other 
hypothesis leading to a similar distribution. 

Pearl (1905) lists the moments of the distributions of brain weights of eight 
European populations of 197 to 529 individuals. The various estimates of c are 
all close to 0-080, ranging from 0-074 to 0-083. Those of Ji range from + 0-11 to 
+0-40, those of y 2 from -0-30 to +1-5. The weighted means are 

c = 0-07965 + 0-00106, ft =+0-2306 + 0-0461, ft = +0-2661 + 0-0922. 

If the cube roots are normally distributed, we should expect ft = + 0-16, 
ft = +0-037; if their logarithms are normally distributed, we should expeot 
ft s= +0-24, ft = +0-10. The latter distribution gives the better fit, but the first 
is not impossible. 

Sinnott (1937) gives a graph of the distribution of the weights of squash fruits 
in an i\, which is positively skew. He shows that a graph of the distribution of 
their logarithms, though negatively skew, is more symmetrical. There is a sug¬ 
gestion that a graph of the distribution of their cube roots would be even more so. 
Unfortunately the actual figures are not given, and since curve-fitting by eye is 
notoriously uncertain, no more can be said. It is much to be desired that, when 
the full data are not given, estimates of the first four moments or cumulants 
should be published. 


11. Summary 

The first four moments of a number of powers and products of normal variates 
are calculated, with special reference to the probable distributions of weights, 
volumes, or areas of organs and organisms. In each case the first two measures 
(y x and y 2 or ^ and /? 2 ) of deviation from normality are obtained in terms of the 
coefficients of variation. The expressions obtained are almost independent of the 
correlation between the linear measurements, provided the coefficient of varia¬ 
tion of the latter are approximately equal. The distribution found is perhaps 
applicable to data on brain weights, but not to data on ducks’ eggs. 
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THE TRANSFORMATION OF DATA FROM ENTOMOLOGICAL 
FIELD EXPERIMENTS SO THAT THE ANALYSIS 
OF VARIANCE BECOMES APPLICABLE! 

By GEOFFREY BEALL 

Dominion Entomological Laboratory , Chatham, Ontario , Canada 
1. Introductory 

The present paper deals with experiments on the control of insects in the field. 
In such experimental work the problem to be investigated is whether more insects 
survive on plots which have been subjected to one treatment than on plots 
subjected to another. It will be shown in the present paper that the numbers of 
insects found per plot must vary in such a way that one cannot, strictly, subject 
the results to the analysis of variance, and it is proposed to find how the data 
may be transformed so that analysis of variance becomes applicable. Such 
transformation has been discussed by Bartlett (1936 a, b) in connexion with 
entomological experiments, and by Tippett (1934) in connexion with industrial 
experiments. 

2. Experimental results considered 

The data used in the following work are results from seven insecticidal 
experiments arranged by the author at Chatham, Ontario. The work was carried 
out with replicated blocks containing plots subjected to treatments of which the 
assignment was random. This procedure, normal in agronomic work, was supple¬ 
mented by one repetition of each treatment within a block. The assignment of 
the repetition of a treatment was independent of the first for that treatment, 
except that, of course, the same plot could not be chosen twice. This repetition 
was carried out to obtain estimates of variability within blocks. In these experi¬ 
ments complete counts were not made but random sampling was employed. 
Experiments on Pyrausta nubilalis Hubn., reported by Beall et al. (1939), for 
which results are shown in Tables 1 and 2, were made on one area at two different 
periods, whereas experiments on Leptinotarsa decemlineata Say, for which results 
are indicated in Tables 3 and 4, were carried out on contiguous areas at the same 
time. Three similar experiments were carried out in one place on the tobacco 
hornworm, Phlegethontius quinqu&maculata Haw., for which the data are shown 
in Tables 5-7. Reference is also made to the data from a uniformity trial on 
insects of Beall (1939). 

t Publication No. 2101, Division of Entomology, Science Service, Department of Agriculture, 
Ottawa, Canada. 
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Table 1. Numbers of an insect, Pyrausfca mibilalis, per plot. Experiment I 



Table 2. Numbers of an insect, Pyrausta nubilalis, per plot. Experiment IT 
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Table 3. Numbers of an insect, Leptinotarsa decemlineata, 
per plot. Experiment III 



Table 4, Numbers of an insect, Leptinotarsa decemlineata, 
per plot. Experiment IV 



Table 5. Numbers of an insect, Phlegethontins quinquomaculata, 
per plot. Experiment V 
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Table 6. Numbers of an insect , Phlegethontius quinquemaculata, 
per plot. Experiment VI 


Treatment 

Block 

1 

2 

3 

4 

6 

6 

1 

12 

13 


4 

11 

4 

1 

13 

9 


7 

5 

10 

2 

13 

6 

8 

1 

5 

7 

2 

20 


0 

4 

12 

7 
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Table 7. Numbers of an insect, Phlegethontius quinquemaculata, 
per plot. Experiment VII 


Treatment 

Blook 

1 

2 

3 

4 

6 

6 

1 

10 

20 

14 

10 

17 

14 

1 

7 

14 

12 

23 

20 

13 

2 

11 

21 

16 

17 

19 
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2 

17 

11 

14 

17 

21 

13 
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7 

3 

2 

3 

1 

3 

l 

2 

1 

1 
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4 

3 

12 

4 

5 

5 

2 

4 

5 

6 

3 

6 

5 

4 

5 

3 

3 

3 

1 

3 

6 

5 

5 

S 

6 
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2 

4 

6 

11 

16 

15 

13 

26 

24 

6 
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22 

16 

10 

26 

13 


3. The relationship between the standard deviation and the mean 
in the experimental data 

If * is the number of insects on one of a group of small contiguous areas, say 
plots, within a larger area, say a block, let the expectation of x over all these 
plots be M and the standard deviation he tr; then over a number of the larger 
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areas, when the insects are distributed in a completely random fashion, from the 
Poisson distribution, cr 2 = M. ( 1 ) 

As is discussed by ‘Student’ (1919) one cannot, however, anticipate that (1) will 
be satisfied when organisms occur in groups, as, say, when insects come from 
masses of eggs, or when there is a change in expectation from plot to plot within 
a block. Generally, cr 2 will tend to be greater than M and we can only say 

<r*=f(M). (2) 

The form of/(if), in (2), must be considered carefully, since it bears on the form 
of the transformation which may be developed to make the standard deviation 
independent of the mean. 

In dealing with (2),Bartlett (1936 a) started by supposing that, approximately, 

cr 2 = KM, (3) 

where K is a constant. Generally, in field data, however, the relationship between 
cr 3 and if, or of their respective estimates, s 2 and x, does not, as in Pig. 1, appear 
to be linear; rather, the departure of s 2 from x becomes disproportionately great 
as x increases. This relationship between departures and the magnitude of the 
mean has been discussed by Clapham (1936) in connexion with data on the 
distribution of organisms differing from insects as muoh as flowering plants, and 
he showed that only those distributions with very low mean have the squared 
standard deviation close to the mean. 

Our discussion above on the shortcomings of (3) suggests the conclusion that 

cr 2 — MozM (4) 

is generally untrue. We propose to consider the possibility that the curvilinearity 
of (2) might be better met by supposing that 

cr 2 —Mcc M 2 . (5) 

Equation (5) leads to cr 2 = M + kM 2 , (6) 

where k is a constant. It will be noticed that 

k — {cr 2 — M ) M~ 2 (7) 

is the Charlier coefficient of disturbance from a Poisson distribution. This 
coefficient was employed by Beall (1935). 

It is possible to consider the suitability of (3), as compared with (6), by finding 
how, respectively, they fit observations on s 2 and x. To fit exactly is difficult, and 
it was found necessary to fall back on an empirical determination of K and of fc; 
thus, if there are a number of pairs of estimates, * and s 2 , from (3) and (6) we 
estimate X = Es 2 fEx, (8) 

k = { Es 2 - Ex )! Ex 2 , (9) 

where E represents the summation over all pairs. 
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Since in the work presented in § 2, x and s 2 , being based on only two obser¬ 
vations, are highly variable, these experiments do not show clearly the suitability 
of (3) and (6). Accordingly, reference is made instead to the data from the uni¬ 
formity trial on Leptinotarsa decemlineata Say of Beall (1939). When the mean 
and standard deviation of 144 sampling units within each of 16 areas were con¬ 
sidered, the estimates from (8) and (9) were K = 2-405 and k = 0-2548. Bor 
these values from (1), (3) and (6), curves, described as lines 1, 2 and 3 respectively, 
are plotted in Fig. 1, the observed values of mean and squared standard deviation 



Pig. 1. The squared standard deviation plotted against the mean for 144 small areas within each 
of 16 large areas; line 1 is from equation (1), line 2 from (3) and line 3 from (6). The counts had 
been made on Leptinotarsa decemlineata Say. 

are also shown. In the cases where the mean is near unity the departure of the 
squared standard deviation from the mean, i.e. from line 1, appears to be trivial, 
but as the mean increases the departure becomes more marked. It can be seen 
that the observations lie more snugly about line 3 from (6) than about line 2 
from (3). Generally, for the data from field studies the same effect has been 
observed. Such results suggest that (6) may be generally a better approximation 
to the form of /(if) than (3) and make it preferable to proceed with the analysis 
of data from the assumption (6). 

4. The transformations of field data 

Fig. 1 shows clearly how, within an area, the variability of the numbers of 
insects on sub-areas is related to the mean number of insects per sub-area. This 
relationship will make invalid the use of the analysis of variance on experimental 
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results involving counts on insects, since the expectation of the variance 
should be the same for all plots. To overcome this invalidity, Bartlett (1936a) 
suggested transforming the observations, x, from the basis of (3). The transforma¬ 
tion found was x 4 , which Bartlett modified to (x + |) 4 . From §3 it was seen, 
however, that for field data the relationship between standard deviation and 
mean may be represented better by equation (6) than by (3), and, since the form 
of the transformation depends on the form of f(M), a fresh transformation must 
be sought. A transformation, as is developed in the Appendix to the present paper, 
is suggested by the method of Tippett (1934), i.e. 

x' - sinh- 1 (fcr)h (10) 

An advance note of this transformation was published by Beall (1940). The 
adequacy of this transformation must be judged from the extent to which it 
stabilizes variability. In (10), if we express sinh -1 (hx) 4 , when lex < 1, as a well- 
known series, we have 

x' - x i ~%kx i + £ 5 k i x i -T^k s xt + (11) 

where it is obvious that for h = 0, x' = x i . Of course, for large values of lex, 
x[ varies almost as log x, or as the log (x +1) used by Williams (1937), and so our 
proposed expansion may be regarded, for practical purposes, as embracing the 
root and logarithmic transformations. 

Table 8 gives the transformation, (10), for a probable range of observations, 
x, and for k at intervals which will probably be close enough for practical purposes. 
This table was computed in part by inverse interpolation from the table of hyper¬ 
bolic functions of the Smithsonian Mathematical Tables (Beoker & Van Orstrand, 
1931), and in part from (12). Should values of x' be required outside those of 
Table 8, these can conveniently be calculated from 

x' = & -4 log„{(&a;) 4 + (1 + fce) 4 }. (12) 

In preparing Table 8 the question arose of whether, instead of dealing with 
k -4 sinh -1 (fex) 4 , one should not use k -4 sinh" 1 (k 4 (x + -|) 4 } in the same way as 
Bartlett (1936a) dealt with the transformation, (x + 1) 4 , instead of x l . This 
modification was rejected on the basis of results of the transformation, as dis¬ 
cussed in § 5, since it was found that the addition of made little difference and 
did not give, consistently, an improvement. 

For field data, in making the transformation (10), it is necessary to estimate 
the value of 1c empirically by (9) for which estimates x, of the mean and s, of the 
standard deviation, must be found. The most obvious method in practice of making 
these estimates seems to be to put more than one plot subjected to a given treat¬ 
ment in a block and so to estimate the chance variation of results for a plot within 
a block. In the present work, as is discussed in § 2, two plots were subjected to a 
given treatment in each block and this is probably good practice. 



Table 8. The transformation x’ = Jc~l sinb~ 3 {kx)^ 
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262 Transformation of data from entomological field experiments 

In the special case where there are two plots for the ith treatment (i — 1, ■ ,.,n) 
in the jth block ( j = 1,.. N), and so two observations, x m and x m , the estimate 
of the mean will be written and of the squared standard deviation 

s % ^ 

Then from (9), we estimate k from 

h = a( £ i (%i-%*)“- £ S(£ S (*iii+*«s) 2 ] (* 4 ) 

i=U=i I U=ij=i 1 

and the calculation is very light. 

5. Results showing the effect of the transformation 

ON THE VARIABILITY OF DATA 

The adequacy of our proposed transformation may be judged in two ways: 
first, with respect to its effect, which we shall consider in the present section, on 
the differences between repetitions of a treatment within a block, and secondly, 
with respect to its effect, which we shall consider in § 6, on the behaviour of the 
quantities submitted to the analysis of variance. 

It is a fundamental assumption in the analysis of variance that the chance 
variability for each plot shall be, when the effect of block and of treatment are 
removed, normally distributed with a standard deviation common to all plots, 
in which situation of course the standard deviation of the chance variability for 
a given plot is independent of the expectation for that plot. In the data of the 
present work, where each treatment is repeated in each block, it is possible to 
examine the estimates of this standard deviation, s {}> and of the expectation, x i} . 
For a clear graphical illustration of the situation consider Fig. 2, as obtained from 
the original data of Experiment III on Leptinotarsa decemlineata, where is 
plotted against and contrast this situation with that obtaining for the 
corresponding quantities s' i} and obtained after transformation (Jc = 0-08) 
in Fig. 3. 

In Fig. 2 the points are widely scattered as is natural from a sample of two; 
nevertheless, it is apparent that for the smallest values of x i)t the values of s i} 
are correspondingly small and fall in a close group. In Fig. 3 the cluster of obser¬ 
vations in the lower left-hand comer of the previous diagram has disappeared, 
and generally the scatter appears to be independent of x' ijy , so that apparently 
the transformation gave.satisfactory results, The nature of the material involved 
is such that it does not seem possible to examine the relationship under con¬ 
sideration more exactly, nor to summarize exactly the corresponding results for 
the other treatments; it can only he said that the same type of result appeared 
although the magnitude of the relationship before transformation depended on 
the magnitude of the differences between the effects of treatments. 

The results shown in Figs. 2 and 3 suggest that the proposed transformation 
has tended to make the standard deviation independent of the mean, in accordance 
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with the assumptions underlying the analysis of variance. In using this procedure 
one actually assumes, more broadly, that a common standard deviation exists, so 
that the homoscedastieity of observations before and after transformation should 
be tested. Thus it is assumed that x ii% and x ii2 are observations from a normal 



0 100 200 300 400 500 



Fig. 2. The standard deviation and mean as estimated from plots by pairs, 
with untransformed data on Leptinotarsa decemlineata. 



Fig. 3. The standard deviation and mean as estimated from plots by pairs, with the transformed 
data on Leptinotarsa decemlineata Say, i.e. using x' = k~i sinh -1 (for)*, (k =0-08). 


population with a 


standard deviation, <r, which is independent of i and j. Then 
is distributed as x 2 with one degree of freedom.-)- Accordingly, 


f In the L l test, discussed by Nayer (1936), this case of estimates of standard deviation with 
one degree of freedom is troublesome since zero values tend to arise when dealing with grouped or 
integral observations. When this is the case L t , which is the ratio of an arithmetic to a geometric 
mean of sums of squares, cannot be calculated. The present treatment may therefore have a wider 
application. 


17-3 
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Vij = (%-%a)/V 2(r should be «iistributed normally with unit standard devia¬ 
tion for all i and j. In order to test the hypothesis of normality with unit standard 
deviation it is only necessary to test for leptokurtosis; for the distribution must 
be symmetrical since the sign of differences, and therefore of y it -, is a matter of 
chance. Since the number of items involved will almost certainly be < 100, and 
since the population mean is zero, the w n criterion of Geary (1935) will provide an 
appropriate test. In using this criterion we must find the ratio of the mean 
deviation to the standard deviation, i.e. 


I n N 

w n ~ S 2 | x in ~~ I 
U=u=i 


n N 


nN% 

i=l)=l 


(15) 


Of course, values of w n may be calculated for transformed data by substitution 
of x' m for x m . 


Table 9. The w n test on the homoscedasticity of counts by plots 
within a block for six field experiments 





Lower 

,*% 

limit 



Value of k 

Transformed 

(Beall) 

“n 

Departure 
by a.D. 


Departure 
by s.D. 

Esti¬ 

mated 

Em¬ 

ployed 

w » 

Departure 
by s.d. 

i 

70 

0-841 

0-757 

0-6059 

-5-33 

0-7525 

-1-91 

Hi 

im 

0-7846 

-0-64 

ii 

70 

0-841 

0-757 

0-7886 

-0-49 

0-7807 

-0-79 

B| ? 

Bfr B 

0-7499 


hi 

28 

Hil: • 

0-737 

0-5973 

-6-33 

0-6431 

-4-15 

mi 5 V 

E29 

0-6838 

-3-11 

IV 

24 

■if: \ ’■ 

0-732 

0-5692 

-6-67 

0-6948 

-2-B7 

B £:rfl 

Kr jfl 

0-7370 

-1-66 

V 

18 

■»?: : ■ 

0-728 

0-7554 

-1-16 

0-8155 

+ 0-13 

B « 

BJjJ 

0-7823 

-0-68 

VI 

36 

KH 

0-745 

0-7959 


0-8166 

+ 0-38 

Hi 

B 

0-8130 

+0-28 


Values of w n from the untransformed observations and from the transformed 
observations, both following Bartlett (i,e. the transformation (£+£)*) and 
following the line suggested in the present paper, are shown in Table 9 for the 
field data of Tables 1-6. For the second transformation the values of k as calculated 
from (14) are shown as well as the nearest value of k entered in Table 8. For 
each experiment the value of nN and also the 0-05 limits of probability, from 
Geary (1935), are shown. There are also shown the departures of observed w n 
from the expected value in terms of the standard deviation, a useful criterion 
since the distribution of w n is almost normal. From Table 9 it can be seen that 
out of the three experiments in which w n fell beyond the lower 5 % limit of 
probability for the untransformed data and the data transformed as (#+!)*, 
in only one experiment did w n fall so with the final transformation. The results 
for Experiment II, in which w n is decreased by the transformation, are peculiar, 
Consideration of the departures from the mean in terms of the standard devia¬ 
tion indicates more clearly the improvement effected by each transformation 
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and how the transformation suggested in the present work secures an improve¬ 
ment of the same, but more marked, character than that secured from the 
transformation of Bartlett. The results suggest that while homoscedasticity may 
not be attained always, it will be approached by means of the proposed trans¬ 
formation. 

6. The effect of the transformation on the analysis 

OF VARIANCE 

As was indicated at the beginning of § 5, our proposed transformation besides 
making the variability within a block for a repeated treatment the same for all 
treatments and blocks, should also provide quantities satisfying the assump¬ 
tions underlying the analysis of variance. Since it is not quite clear how, in so far 
as the transformation is satisfactory in the first way, it will necessarily be satis¬ 
factory in the second, it will be well to consider directly the suitability of our 
transformed values for the analysis of variance. 

In the application of the analysis of variance one would deal with x {]t rather 
than with x ijk and suppose that 

x a. — A + B i + Gj + Dy, (16) 

where A is a contribution from the general level of population on the experi¬ 
mental area, jB 4 the contribution of the ith treatment and 0 } the contribution of 
the jth block. The remainder term, D i]} is called the interaction of treatments and 
blocks. Of course, the present discussion on the untransformed values, x ip , 
holds for the transformed values, when the appropriate 

symbols, ABp 0) and D' i} are used. 

In material satisfying the conditions underlying the analysis of variance, for 
the observations under each treatment, the calculated squared standard devia¬ 
tion is in 

4 = ^ (17) 

Following the argument of the analysis of variance, x i} _ - x i 7 of which the mean 
is 0, is an estimate of G t + D ijt in which the two terms are independent; hence the 
expectation of s'j is ^ ^ + crf^, ( 18 ) 

where cr oj and cr Dij are the standard deviations of the parameters, and D ip 
respectively, and are independent of treatment. Accordingly, a t should be in¬ 
dependent of treatment and distributed as an estimate of <r { , having N — 1 degrees 
of freedom. Conversely, if x ip cannot be built of the independent terms of (16), 
then the various values of s t will not be distributed as estimates of a single 
standard deviation. The hypothesis that the values of a { in any one experiment 
are estimates of one quantity may be tested.^ 

f From correspondence with Dr R. W. B. Jaokson, the writer has learned that he had arrived 
independently at the same test. 
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The results of the tests on the homogeneity of the values, s it within the six 
experiments treated in the present paper are presented in Table 10, where the 
value of the L x criterion is shown for the original data and for the transformed 
values together with the appropriate 0-05 and 0-01 levels of probability (Nayer, 
1936). From Table 10 it can he seen that of the values of L x obtained from the 
original data, all but one are near or beyond the 0-05 level of significance, but 
that after transformation all are moved in to less significant values. Accordingly, 
the values of s { , when calculated from the original data, appear heterogeneous hut 
the corresponding values obtained after the transformation appear homogeneous. 
Thus it is more probable that the analysis of variance is applicable to the trans¬ 
formed data than to the untransformed. 


Table 10. The homogeneity, as measured by the criterion L t , of the estimates 
s t for various values of i before and after transformation 



Experiment 


2 

3 

B 


6 

L 1 before transformation 

0-861 

0-833 

0-325 

0-657 

0-344 

0-630 

L x after transformation 

0-864 

0-941 

0-766 

0-688 

0-813 

0-730 

1 % limit 

0-167 

0-767 

0-604 

0-642 

0-514 

0-683 

5% limit 

0-812 

0-812 

0-707 

0-656 

0-648 

0-673 


In Table 10 we have tested the homogeneity of the estimates, s it as in § 5 we 
tested the homogeneity of s i} , that is without reference to the values of the 
associated means. In view of our original assumptions we are, however, interested 
in the possibility that the standard deviations, as calculated, might show every 
sign of being estimates of a common standard deviation and yet be dependent 
on the associated means. Accordingly, we have investigated such dependence 
roughly by fitting by least squares a first order regression of s t on From this 
fitting we record the sign of the regression as follows: 



Experiment 


B 

2 

3 

4 

5 

6 

Before transformation 

+* 

.f** 

+* 


+ 

+ 

After transformation 

- 

4* 

- 

B 

+ 

+ 
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By a single asterisk we have indicated cases where the reduction in variability 
effected by the regression passed the 5% probability limit and by a double 
asterisk where it passed the 1 % limit. Several points may be noted. (1) In two 
cases (Experiments 2 and 6) after transformation the residual sum of squares 
about the regression was greater than the reduction in squares due to the regres¬ 
sion, whereas it was consistently less before transformation. (2) As can be seen 
above, the regression generally did not effect a significant reduction in variability 
after transformation but did before (the small number of degrees of freedom made 
high significance difficult of attainment). (3) After transformation the sign of 
the regression seemed to be a chance matter, whereas before transformation it 
was consistently positive. These results suggest that the transformation pro¬ 
posed did tend to make the variability within a given treatment independent of 
the mean for that treatment. 

7. The effect of transformation upon the conclusions 

FROM THE ANALYSIS OF VARIANCE 

It has been shown in §§ 5 and 6 that the analysis of variance can be made on 
entomological data when a suitable transformation has been effected. It is of 
practical interest to see what numerical effect such transformation will have upon 
tests on the significance of, say, the effect of treatment and the significance of 
differences for treatments. 

Eirst, consider the numerical results to be obtained from the analysis of 
variance (1) without and (2) with transformation. Thus the mean square asorib- 
able to blocks, treatments and their interaction is shown in Table 11, for six 
experiments of which the data are given in §2; parallel results are presented for 
untransformed observations and for observations transformed by (10) with the 
-values of k from Table 9. To facilitate the comparison of the results, the mean 
square for blocks and for treatments is expressed in terms of the estimate for 
interaction, as the F of Snedecor (1934), and presented in each case. The trans¬ 
formation of the data has modified the conclusions to be drawn from the analysis 
of variance in Table 11, in that there are considerable changes in the criterion, F, 
for treatments or for blocks. In the examples shown the effect of treatments was 
highly significant in all cases and so the changes introduced by transformation 
did not alter the conclusions, as would have been the case for less definite effeots. 

Consider next the effect of transformation on the significance of differences 
between the means for treatments as tested by the criterion, t, calculated with 
such estimates of mean square as the interaction of Table 11. For illustration, 
values of t, from the data on Leptinotarsa decemlineata (Experiment III), are 
shown in Table 12 for each possible comparison of treaments when untransformed 
data are used, when the transformation, [x + 1)1, as suggested by Bartlett (1936a) 
is used, and when the transformation, sinh -1 (for)*, as suggested in the present 
paper is used. In order that the influence of the level of population under each 



Table U. The analysis of variance of untransformed and 
transformed data in six experiments 


Variation 

Degrees 

of 

freedom 

Untransformed data 

Transformed data 

Mean 

square 

F 

Mean. 

square 

F 


Experiment I. P. nvbilalis 



Between blocks 

9 

92'8 

1-21 

0-666 

1-66 

Between treatments 

6 

2,839'0 

36-96 

7-61 

22-03 

Interaction 

64 

76 '8 

— 

0-341 

— 


Experiment II. P. nuhilalis 



Between blocks 

9 

677-0 

6-72 

6-37 

10-65 

Between treatments 

6 

1,721-0 

20-04 

8-69 

17-07 

Interaction 

64 

85-9 

— 

0-609 

— 


Experiment III. L , decemlineata 



Between bloolts 

6 

20,172-0 

2-26 

4-67 

3-03 

Between treatments 

3 

390,932-0 

43-77 

111-6 

72-16 

Interaction 

18 

8,931-0 

— 

1-64 

— 


Experiment IV. L, decemlineata 



Between blocks 

6 

12,960-0 

2-06 

2-06 

1-96 

Between treatments 

3 

124,064-0 

20-40 

20-40 

19-41 

Interaction 

16 

6,727-0 

— 

1-05 

—• 


Experiment V. P. quinquermculata 



Between blocks 

6 

27-4 

2-74 

0-349 

4-19 

Between treatments 

2 

762-0 

75-33 

19-6 

233-77 

Interaction 

10 

9-98 

-: 

0-083 

— 

Experiment VI. P. quinquemaculata 



Between blocks 

5 

41-7 

4-13 

1-50 

4-12 

Between treatments 

6 

66-2 

6-66 

3-33 

9-14 

Interaction 

25 

10-1 

'- 

0-365 



Table 12. The values of t, in the comparison of means, as calculated from the 
untransformed and the transformed data of Experiment III on Leptinotarsa 
decemlineata 


Comparison 

Means for 
untransformed 
data 

t -without 
transformation 

t from 
(*+»* 

t from 

& -i sinb" 1 (for)* 

*x 

362 

30 

+ 9-27** 

+ 11-33** 

+ 9-92** 


362 

224 

+ 3-84** 

+ 3-62** 

+ 2-11* 


362 

II 

4-9-82** 

+12-89** 

+12-46** 

* 2 . ,-* 3 .. 

30 

224 

-6-43** 

- 7-81** . 

- 7-81** 

x 2.r x i.. 

30 

11 


+ 1-66 

+ 2-64* 

*0. 4 .. 

224 

11 

4-5-98** 

+ 9-37** 

+ 10-36** 
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treatment may be judged, there are shown, also in Table 12, the means for the 
untransformed data. The values of t falling beyond the 0-01 level of significance 
have been marked with two asterisks and the values beyond the 0-05 level with 
one. It can be seen that the transformation resulted in a profound alteration in 
the conclusions. Apparently oh account of the dependence of variance on mea'n 
in untransformed data, the pooled estimate of variance was orig inal ly too low 
for the treatments which resulted in high populations and too high for the treat¬ 
ments which resulted in low populations. Thus, in the comparison of the first 
and third treatments, which appeared to have the two highest surviving popula¬ 
tions, the value of t calculated from untransformed values was high. In the other 
extreme case, the comparison between the second and fourth treatments, the 
value of t, as calculated from untransformed values was very low. It can be seen 
further, that the first transformation only secured in part the modification in 
the value of t that was secured by the second transformation. 

8. The procedure of transformation in practice 

The methods which were found applicable in the preceding discussion will 
now he illustrated in the transformation of the data shown in Table 7 (Experiment 
VII) on Phlegethontius quinquermculata, of the same type as the experiments 
previously discussed in the present paper. The steps in the analysis will he set out 
with the purpose of providing a model for procedure in estimating the constant, 
k, which will be used to effect a transformation,of the data so that the analysis 
of variance may be made. 

Supposing that the experiment has been laid out with a repetition of each 
treatment in each block, the procedure of estimating k makes it first necessary to 
find the sum and the absolute difference of each pair of plots subjected to a given 

n N 

treatment in a given plot andthen to sum the sums, 2 2 i x iji + x az)> and the sums 

n N n jv 

squared, 2 S(%i+% 2 ) a > and also the differences squared, 2 2(% i~ x ij 2 f> 

i=lj = l {= lj=l 

over all such pairs and by substituting the results in (14) to find k. In the case 
being used for an illustration the two plots subjected to the first treatment in 
each block gave respectively 10 and 7, 20 and 14,14 and 12,10 and 23,17 and 20, 
14 and 13, so that 

2 (%i + *W= (10 + 7) + (20+14)+ (14+12)+... = 174. 

Similarly, 

S(®ui + *i«) 2== (10 + 7) 2 + (20+14)H... = 5308 
and similarly, 

S(%-%) ! = (10-7)2+(20-14)2+... = 228. 
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Of course, in estimating k the summations are not limited to one treatment but 
must be extended over all in the experiments. If this is done we And 

i S (Ztfi+%2) = 684, S S (*W + x rn? = 19,656, S | (* M1 - *y 2 ) 2 - 708. 

r=l3'=»l t=lj-l 

From (14) we estimate k = —^ —- - 0-002, 

and referring to Table 8, p. 250, use k = 0-00 as the nearest value occurring there. 
Of course, in this case, the transformation is simply xK 

Now from the above result it will be possible to replace the observed values of 
Table 7 with the corresponding transformed values from the first column of 
Table 8. Thus in Table 7 replace in the first row: 10, 20, 14, 10, 17 and 14, by 
3-16, 4-47, 3-74, 3-16, 4-12 and 3-74. With such transformed values we can now 
proceed to carry out a routine analysis of variance which will be facilitated by 
working with the sum for each pair of plots in a given block with a given treatment. 
For example, the final analysis of variance for Experiment VII would be carried 
out with the values of Table 13. 


Table 13. Transformed and summed values to be used in the analysis of 
variance for Experiment YU on P. quinquemaculata 


Treatment 

Block 

1 

2 

1 

4 

5 

6 

1 

6-81 

8-21 

7-20 

7-90 

8-69 

7-35 

2 

7-44 

7-90 

7-74 

8-24 

8-94 


3 

1-00 


2-73 

2-41 

1-73 


4 

3-97 

5-91 

3-73 

4-48 

4-48 

3-41 

5 

3-97 

3-97 

4-18 

2-00 

3-14 

4-45 

6 

6-32 

8-56 

7-87 

6-77 

10-20 

8-51 


9. SUMMABY AND CONCLUSIONS 

The foregoing work is a study of experimental results from seven field experi¬ 
ments on the control of insects. In such data, the standard deviation of the 
number of insects per plot varies with the mean. By the transformation, 
x' — sinh -1 (fcr)t, where k is a constant and x an observation, the data were 
put in a form for which the standard deviation approached a constant independent 
of the mean. The estimation of the one constant, k, necessary for the transforma¬ 
tion was made possible by the design of the experiments with repetition of treat¬ 
ments within blocks. In practice, the transformation gave good results so that 
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analysis of variance could be made. From the analysis of the transformed data, 
the results were found to differ markedly from those which would have been 
obtained from the untransformed data. 
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APPENDIX 

As has been said, the transformation of (10) was suggested by the method used 
by Tippett (1934, p. 61), The procedure is as follows. 

It is required to find x' such that the standard deviation, n x , of x', 

shall be approximately constant. Let us write 

x' =f(M)+f'(M)(x-M) + ..., (19) 

where M is the expectation of x and whence, approximately, 

(x'-M') =f'(M)(x-M), (20) 

where M' is the expectation of x' . Hence 

< 7 % = ( 21 ) 

where <r is the standard deviation of the observations, x. Replacing.oy. in (21) 
by a constant, c, as is the purpose of our operation, and substituting for <r from 
equation (6), p. 247, we have 

f{M) = c{M+kM*)~i, (22) 

where h is, as has been previously discussed, a constant peculiar to our data. 
Integrating in (22), 

f{M) = 2cJfc-*sinh- 1 (hif) i . (23) 

From (23) the form of the function suggested is sinh -1 (kx)*, but it is wise instead 
to use fc-l sinh -1 (kx)*, since the transformation then becomes identical, as shown 
in (11), with the established transformation, ad, when k = 0. 

As Tippett (1934) says: ‘This derivation is not mathematically sound, and 
the result is only justified if on application it is found to be satisfactory.’ The 
writer would have hesitated to have used it had it not already led to useful 
transformations in oases analogous to the present, namely to ad where x comes 
from a Poisson distribution, to sin -1 p* where p comes from a binomial distribu¬ 
tion and, according to Tippett, to tanh -1 r, where r is the correlation coefficient. 
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INTERPOLATION EOR FRESH PROBABILITY LEVELS 
BETWEEN THE STANDARD TABLE LEVELS 
OF A FUNCTION 

By J. B. SIMAIKA 
1. The problem 

A number of tables of probability functions exist, and more will no doubt before 
long be available, giving values for a variable x corresponding to a limited number 
of simple probability levels a. How far is it possible to. obtain x rapidly for inter¬ 
mediate values of a? 

The variable may be put into standardized form as the ratio of the deviation 
from the mean to the standard deviation; these two quantities (i.e. the mean and 
standard deviation) are often easy to obtain, whereas the probability integral 
may require extensive computation. Denote by the standardized variable 
so that 

_ x a — mean x , . 

“ ~ standard deviation of*' 

The question we shall consider is this: having full and accurate tables relating 
u a and a for a standardized normal variable, denoted by U a , can we use these 
values as auxiliary in obtaining u a for any other function tabled only at a few 
probability levels and, having found an interpolating formula, what is its accuracy ? 
In examining this point we shall compare the accuracy of the method with that of 
some other methods of deriving intermediate values of u a . 


2. General approach 


It will be useful to consider first how far a general theoretical approach will 
take us. Let the variable x follow a probability law defined only by its cumulants 
K r (r = 1,2,...); then the first four cumulants of the variable % become 0, l,y v y 3 . 
It is known that the relation between u a and a may be written symbolically 


«<x 


>V(2n) 


i r . d 3 , & 

exp[^-iy 1 ^ 3 +^y axr4 +. 


l dx 3 




e~ ix '~ dx, 


while the same relation for a normal variable is 


( 2 ) 


a = 



e-i^dx. 


( 3 ) 


Using equations (2) and (3) it has been shown by Cornish & Fisher (1937) 
that u a can be approximated to by a parabolic curve in U a and vice versa. If we 
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assume that K r for r Js 5 is negligible, this expression, using a fifth degree parabola, 
is u.~A + BU lt +OVl+DUl + JBU* + l , Ul (4) 

where A ?= -iyi-TtYiVa+^'y!, 

R -1 - |y*+Ar! wryl ~ iMrlrz - ffHrf. 
c = Iri+Arira-^y?. 
b = ^rs-Tjr! +tV/2 -iSt7Sy*+lttyt 
•®= ~Ayiy»+Ao'!. 

-F = -iibyl +i?4yiys'”'?f5yi- 

Now as y 1 and y 2 tend to zero, B tends to unity and all other coefficients tend 
to zero, i,e. the curve of u a as a function of Z7 tt tends to the diagonal line 

V-*4. 

Figs. 1-4 give these curves for different numbers of degrees of freedom for the 
commonly used statistical variables ^ 2 , x, t and v (a transformation of z referred 
to below), expressed in standardized form. 

With regard to the coefficients in (4) it may be remarked that large values of 
y 2 do not increase them as much as large values of y v Furthermore, when y 1 is 
zero, the coefficients A, C and IS vanish and the expression (4) becomes 

u a ~ BU a +DUl + FUl, 

or p~B + r>Ul+F{U%}*. (5) 

u a 

These broad results suggest that a good method of interpolation, when both y x 
and y 2 exist, is a Lagrange formula through the points 

Ki’UJ (*-1,2,...). 

Remembering that u a . ~ (a; a .-meana;)/(s.r). of x), from the practical point of 
view the interpolation can be carried out more expeditiously and without loss 
of accuracy by using a Lagrangian formula through 

(®o,»*4|) (*-1,2,...). 

When, however, y 1 is zero as in the case of the f-distribution it would be better to 
take the Lagrange formula through the points 

(jf . or alternatively through ^5*, D« J (< — 1,2,...), 

the mean * being zero. 

The accuracy of the method is likely to depend on the value of y 1 and y z . 
For example, as seen in Fig. 1, linear interpolation between, say, t/ 0 . 05 and U m2 
will be more accurate with v = 18 than v = 3, the y’s being smaller in the former 
case. Again, the curves will be more nearly linear if we take as variable 

u = (X - mean x)ja x rather than n = (y 2 - mean ^*) jay 
because the y’s for the former are the smaller. 



J. B. SlMAIItA 


265 


For the practical worker it will often be sufficient to use linear interpolation, 
i.e. to make use of two tabled probability levels only. For more accurate work 



Standardized normal variate, V 
Fig. 1. Relation between it(/) and U. 

three or more levels can be used, but an increase beyond this is not, in fact likely 
to lead to a gain in accuracy which will be worth the labour. 
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3. Comparison of methods 
To compare the accuracy of the interpolation based on the polynomial 
expansion (4) with that obtained by other possible methods, we have considered 
the y a , t and v (beta)-probability distributions. For each, different methods of 
interpolation have been devised. In some a transformation of the variable has 



been used, while in others a transformation of the argument or a completely new 
argument has been considered. 

The accuracy of each method in the range covered by the values a <0-10 
and cc Js 0'90 has been tested in the following way: between any two consecutive 
probability levels a number—not less than three—of intermediate values have 
been interpolated. These values were chosen to be those which could be obtained 
accurately from some other table. The greatest deviations in each interval are 
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given in Tables 1—3. The methods have been arranged according to the degree 
of accuracy obtained. 



It remains to point out that, if the n tabulated probability levels are denoted by 

a 1 <a a <... <«„^0-10, 

and if the interpolation is to be carried by using a quadratic or higher expression 
in the argument, it has been found that the interpolated value is always more 

Biometrika xxxn x8 
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accurate when the probability levels used include as many as possible of the 
probability levels below a x . Similarly, if 

0-&0 < a x < a 2 <... < a n , 

it is better to use probability levels including as many as possible of those above a x . 


-1-0 -1-5 -2-0 -2-5 



Standardised normal variate, V 
Fig, 4. Relation between u ( v ) and U , where p ( v ) = WV- 1 (1 — 


4. The ^-probability tonotxoh 
The standardized form of yf used in Pig. I is 


u(x*) = 


V(^) ’ 


( 6 ) 


v being the number of degrees of freedom. The y x and y 2 of this distribution are 
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V( 8 M and 12 1 p respectively. If we consider % itself instead of x 2 we find that its 
standardized form has the approximate value, used in Fig. 2, 


«(X) = 


1-jr 1 ’ 


(7) 


and its y x and y 2 are (2p)~t + 0(v 2 ) and 0(f~ 2 ) respectively. Both these last 
quantities are smaller than those for x 2 > which suggests that interpolation will 
be more accurate using x rather than y 2 . This can also be seen from Figs. 1 and 2. 

The x 2 probability levels have been tabulated by Fisher (1941) for a = 0-01, 
0-02, 0-05, 0-10, ..., 0-90, 0-95, 0-98, 0-99 and are given to three decimal places. 
These levels were used in the interpolation.* The accuracy of the interpolated 
values obtained by the eight methods detailed below was checked either from the 
Tables of the Incomplete Gamma Function (Karl Pearson, 1922) or from Tables 
or Statisticians and Biometricians, Part I, Table XII (Karl Pearson, 1930). 

The greatest deviations, (m = 1, 2,..., 8), obtained using in all eight different 
methods, are given for v = 3,5, 9 and 18 and for intervals of a: (0-01,0-02), 
(0-02,0-05), (0-05,0-10), (0-90,0-95), (0-95,0-98) and (0-98,0-99) in Table 1. 


Method 1. 

X z -A + B\oga (a<0-10), x 2 = -4 + Blog(l-a) (a>0-90). 
Method 2. 

•m(x) == A + B log a (a<0-10), u{x) = .d-f Ulog(l —a) (a>0-90). 
Methods. uffl'l — A + BU. 

Method 4. 

u(x 2 )~G = A +Bloga (a<0'10), «(X 2 )“ U = + -Blog(1 -a) («>0-90). 

Method 5. «(x) — A + BU. 

Methods. x 2 = ^ + & log a 4- Clog 2 a (a <0-10), 

X 2 — A + Blog(l—a) + (71og 2 (l —a) (a >0-90). 

Method 7. w(x 2 ) = A + BU + GU 2 . 

Method 8. u(x) = A + BU + CU 2 . 

The best linear interpolation is that provided by method 5 and the best 
quadratic one is given by method 8. Both these methods are those suggested by 
the general approach. 


* Certain additional levels are given in a recently published table computed by Catherine M. 
Thompson (1941). 
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Table 1. Greatest deviations in the interpolation of x l and X- 


V 

7i 

h 

Interval 



tel 




tel 

Kl 

tel 

tel 

3 

1-63 

4-00 

(0 02, 0-05', 

0-02 997 

0-245 

0-006 

0-007 

0-011 

0-012 




0-001 




(0-05, 0-10) 

0-07 890 

0-490 

15 

8 

10 

i 

4 

4 

3 

0 




(0-90, 0-95) 

0-92 810 

7-000 

4 

3 

27 

14 

5 

1 

0 

1 




(0-95, 0-98) 

0-97 071 

9-000 

5 

33 

36 

10 

6 

0 

0 

2 




(0-98, 0-99) 

0-98 827 

11-000 

1 

13 

34 

5 

1 

2 

1 

2 

6 

1-26 

2-40 

( OOl , 0-02) 

0-01 353 

0-632 

8 

5 

6 

3 

2 

6 

0 

0 




(0-02, 0-05) 

0-03340 

0-949 

23 

13 

15 

3 

5 

5 

1 

0 




(0 05, 0-10) 

0-06150 

1-265 

19 


11 

0 

3 

4 

0 

1 




(0-90, 0-95) 

0-92 808 

10-119 

11 

32 

26 

12 

5 

3 

1 

1 




(0-95, 0-98) 

0-96 544 

12-017 

13 

39 

32 

12 


2 

1 

0 




(0-98, 0-99) 

0-98 579 

14-230 

5 

18 

17 

6 

4 

2 

1 

1 

9 

0'94 

133 

(0-01, 0-02) 

0-01 060 

2-121 

4 

2 

2 

2 

0 

1 

1 

■o 




(0-02, 0-05) 

0-03 452 

2-970 

34 

21 

19 

0 

6 

6 

0 

1 




(0-05, 0-10) 

0-07 705 

3-818 

33 

24 

17 

6 

5 

8 

1 

0 




(0-90, 0-95) 

0-93 312 

16-000 

19 

38 

17 

11 

5 

4 

0 

0 




(0-95, 0-98) 

0-96 483 

18-000 

22 

46 

31 

12 

7 

4 

1 

1 




(0-98, 0-99) 

0-98735 

21-000 

8 

18 

14 

5 

4 

2 

1 

0 

18 

0-67 

067 

(0-01, 0-02) 

0-01 167 

7-200 

14 

9 

5 

2 

1 

4 

1 

1 

m 



(0-02,0-05) 

0-02 793 

8-400 

47 

32 

20 

2 

5 

7 

0 

1 

1 



(0-05,0-10) 

0-07482 

10-200 

48 

34 

18 

6 

6 

11 

1 

0 

■ 



(0-90, 0-98) 

0-93 159 

27-600 

34 

52 

25 

10 

5 

7 

1 

1 

■ 



(0-95,0-98) 

0-96 265 

30-000 

34 

55 

28 

10 

6 

6 

1 

1 

1 

■ 


(0-98,0-99) 

0-98 689 

33-600 

15 

26 

13 

5 

4 

3 

1 

1 


For methods associated with subsoripta to 5, bob p, 269. 


Table 2. Greatest deviations in the interpolation of t 


B 



a 

True t 

tel 

tel - 




tel 

m 

tei 

3 

CO 

(0-005, 0-01) 

0-00 692 

5-198 

0-039 

0-040 

0-035 

0-020 

0-011 

0-008 

0-007 

0-004 



(0-01, 0-025) 

0-01 430 

3-969 

64 

52 

42 

21 

15 

7 

6 

2 



(0-025,0-05) 

0-03 261 

2-848 

28 

24 

16 

7 

8 

5 

3 

0 



(0-050,0-10) 

0-07 285 

1-954 

25 

23 

11 

4 

7 

3 

3 

1 

6 

3-00 

(0-005,0-01) 

0-00 714 

3-413 

11 

8 

5 

3 

2 

1 

1 

1 



(0-01, 0-025) 

0-01 517 

2-820 

18 

14 

6 

3 

2 

1 

1 

1 



(0-025,0-05) 

0-03 874 

2-128 

8 

6 

1 

1 

2 

2 

0 

1 



(0-050,0-10) 

0-06 820 

1-719 

9 

8 

1 

1 

1 

1 

0 

0 

10 

1-00 

(0-006, 0-01) 

0-00 625 

3-038 

5 

4 

1 

2 

1 

1 

1 

1 



(0-01, 0-025) 

0-01 639 

2-476 

8 

11 

1 

1 

1 

1 

1 

1 



(0-025, 0-05) 

0-03 844 

1-972 

4 

3 

2 

1 

I 

1 

0 

0 



(0-050,0-10) 

0-07 246 

1-581 

4 

4 

5 

0 

1 

1 

0 

0 


For methods associated with subscripts to 8, see p. 271. 
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5. The {-probability function 

The standardized form of the {-probability function used in Fig. 3 is 

um-t (8) 

v being the number of degrees of freedom. The values of y 1 and y 2 are zero and 
6/( v — 4) respectively. 

Percentage levels for t have been tabulated by Fisher (1941) for a = 0-006, 
0-01, 0-025, 0-05, 0-10, ... and are given to three decimal places. The level here 
defined as a is half the figure given in Fisher’s table, i.e. 


a = 



(9) 


The values of t used in checking those obtained by interpolation are taken 
from Tables of the Incomplete Beta-Function (Karl Pearson, 1934), where 

4 iP> T) = a, 

1 ( 10 ) 
P ~ i v > 4 = i and x = Y+¥Jv 
v being the number of degrees of freedom. 

The greatest deviations found in the following eight methods are denoted 
by S m (to = 1, 2,8), and are given in Table 2 for v — 3, 6 and 10 and for the 
intervals (0-005,0-01), (0-01,0-025), (0-025,0-05), (0-05,0-10). 


Method 1. 

u(t) = A + BU. 

Method 2, 

u(t) — U = A + B log a. 

Method 3. 

t = A + B log a. 

Method 4. 

u(t) = AU + BV S . 

Method 5. 

w(t) = A + BU + CU*. 

Method 6. 

t — A + B log a + C log 2 a. 

Method 7. 

u(t) — U — A + -Blog a + Clog 2 a. 

Method 8. 

u(t) = AU + BU 3 + CU\ 


From the methods that make use of only two probability levels, method 4 
is the one that gives the highest accuracy and from those that make use of three 
probability levels, method 8 is the best. These two methods are those suggested 
by the general approach, 


6. The v or beta-distribution 
This variable> which is related to R. A. Fisher’s z, is defined as 


v = 


S t + S 2 v 2 -t v^e 2 * 


( 11 ) 
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where 8 1 is a sum of squares of normal variates based on v t degrees of freedom and 
8 1 an independent sum of squares based on v 2 degrees of freedom. The elementary 
probability law of v is 

( 12 ) 


Table 3. Greatest deviations in the interpolation of v 


B 

B 







l*.l 

14.1 

Kl 

| 4 .| 

14.1 

| 4 ,| 

i 

u 






0-00043 



0-00007 

0-00006 


■1 





( 0 - 01 , 0 - 05 ) 

0-02 607 

0-59000 

296 

sBSwil 

34 

35 

■7 

27 

1 





( 0 - 05 , 0 - 10 ) 

0-06 901 

0-66000 

79 

5 

28 

14 

6 


1 

8 

30 

- 0-61 

+ 0-25 

( 0 - 005 , 0 - 01 ) 

0-00 789 

0-53000 

25 

4 

7 

16 

9 

5 

8 





( 0 - 01 , 0 - 05 ) 

0-02 711 

0-59000 

228 

36 

24 

35 

4 

13 

1 





( 0 - 05 , 0 - 10 ) 

0-06 645 

0-64000 

67 

4 

16 

16 

2 

5 

2 

15 

30 

- 0-28 

- 0-12 

( 0 - 005 , 0 - 01 ) 

0-00666 

0-41000 

31 

3 

3 

8 

9 

7 

8 





( 0 - 01 , 0 - 05 ) 

0-02 826 

0-47000 

240 


22 

42 

3 

9 

3 





( 0 - 05 , 0 - 10 ) 

0-07 463 

0-52000 

78 

9 

1 


1 

9 

1 

8 

15 

- 0-33 

- 0-26 

( 0 - 005 , 0 - 01 ) 

0-00677 

0-30000 

53 

19 

H3 

13 

3 

3 

3 





( 0 - 01 , 0 - 05 ) 

0-02 519 

0-37000 

418 

128 

114 

67 

11 

22 

2 





( 0 - 05 , 0 - 10 ) 

0-06 972 

0-44000 

126 

27 

16 

29 

2 

15 

3 

4 

5 

- 0-16 

- 0-77 

( 0 - 005 , 0 - 01 ) 

0-00 796 

0-09000 

80 

32 

58 

29 

16 

5 

6 





( 0 - 01 , 0 - 05 ) 

0-02 723 

0-15000 

838 

375 

534 

181 

93 

19 

12 





( 0 - 05 , 0 - 10 ) 

0-07 421 

0-23000 

278 

123 

144 

78 

33 

8 

11 


Sor methods associated with subBoripts to d, see p. 273. 


Low values of v correspond to high values of z, The standardized form of v 
used in Tig. 4 is 

v -— 



(13) 


and the values of y 1 and y 2 are 


?i = 

r 2 = 


h-h ll Hh+h+2)\ 
iq+r a +4V\ jqiq /’ 

12(r|(r 1 +2) + rj(r 2 +2)-2r 1 r !1 (i/ 1 + u a -i-4)} 
>Wv, + v, + 4)(v 1 + ji, + 6) 


(14) 


The cases considered here are those moat generally met in tests of significance 
with jq < jq and therefore y x < 0. y 2 is sometimes positive and sometimes negative. 

Tables giving values of v for <x = 0-005, 0-01, 0-025, 0-05, 0-10, 0-25 and 0-50 
to five decimal places have recently been published (Catherine M. Thompson, 
1941). The corresponding upper probability levels can be obtained by entering 
the tables with jq and iq transposed and taking l~v for v. 
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The values of v used in checking the interpolation are taken from the Tables 
of the Incomplete Beta-Function (Karl Pearson, 1934). 

The greatest deviations S m (m = 1,2,7), found in the following seven 
methods are given in Table 3 for the following pairs of values of iq and v 2 : 
(4,20) (8,30) (15,30) (8,16) (4,5) and for the intervals (0-005,0-01), (0-01, 
0-05), (0-05,0-10) of a. 

Method 1. 

v = A-\-B\oga (a < 0-10), v = u4 + J81og(l-a) (a >0-90). 
Method 2. 

u{v) - V = A + B log a (a <0-10), u(v)- U = A + JSlog(l-a) (a>0-9Q). 

Methods. u(v) = A+BU. 

Method 4. v = A + -Bloga-f Clog 2 a (a<0-10), 

v ~ ,d-t-.Blog(l — a) + G log 2 (1- a) (a >0-90). 

Method 5. u(v)— U = A +B\oga + C\og 2 oc (a<0-10), 

u(v)-U = J. + 51og(l-a)-fC'log 2 (l-a) (a>0-90). 

Method 6. u(v) = A+BU + CU 2 . 

Method 7. n(v) = A + BU+CU Z + DU 3 . 

Here also the best linear method and the best quadratic one are those sug¬ 
gested by the general approach, namely, methods 3 and 6. Method 7, a cubic in 
U, was used only because the great accuracy of the tabulated ^-function justified 
the computation. This cubic interpolation gives errors of the order of 0-0001 
even for numbers of degrees of freedom as small as iq — 4, iq = 5. If the interval 
between a = 0-01 and ()-05 were broken into two parts at 0-025 (a level given in 
the Thompson tables) the corresponding errors S 3 would be considerably reduced. 

7. Some numerical examples, using linear interpolation 

(a) Interpolation for a y 2 level. Suppose that we calculate the upper 2| % 
level (a = 0-975) for y 2 with v = 5 degrees of freedom, using tabled values of the 
upper 2 % and 5 % levels. We require the 2 %, 2| %, and 5 % levels of the stan¬ 
dardized normal variable as well as the two y 2 levels. The relevant data are shown 
in the following table; as pointed out above, there is no need to calculate the 
standardized form of either y or y 2 . 


a 

u a 

xl 

Xa 

0-98 

2-0537 

13-388 

3-659 

0-975 

1-9600 

? 

? 

0-95 

1-6449 

11-070 

3-327 
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By linear interpolation, i.e. using method 3 of p. 269, we have 

, 1-9600-1-6449 

Xlm = 11-070 + (13-388-11-070) ^ 

= 12-857. 

Interpolating similarly for y, i.e, using method 5 of p. 269, we find 
Xo -976 == 3‘583 or yo- 9?5 = 12-838, 

The correct value taken from Miss Thompson’s table (1941) is 

A &976 = 12-8325. 

It is seen, as expected on theoretical grounds and as evidenced in the comparisons 
of Table 1, that method 6 is the more accurate of the two. 

(b) Interpolation for a t level. Suppose that we calculate the value of t corre¬ 
sponding to a = 0-0125 (as defined in equation (9)), with v — 6, using the tabled 
levels for a = 0-005 and 0-025. The data required are shown below, the values of 
t being taken from the table on p. 300 of this issue. 


a 

u x 

ta 


Ua 

0-006 

-2-6758 

-3-7074 


6-6347 


-2-2414 



5-0239 


-1-9600 

— 2*4469 

.. 


3-8416 


Again, it is not necessary to calculate the standardized values of t a , for even 
when using the relation of method 4, which assumes 

~jj = A + BU\ (15) 

the transference of cr t to the right-hand side of the equation will only modify 
the constants A and B whose values are not directly determined in the inter¬ 
polation process. 

Using method 1 (t a linear function of U), it is found that 

^•0125 = “ 3-023. 

Using method 4 (1/(7 a linear function of Z7 2 ), it is found that 

^0-0125 = ~ 2-979. 

The correct value is - 2-969. As can be seen in Fig. 3, from the stretch of the 
curve for v = 6 between a'= 0-975 and 0-995, the interval chosen is too long for 
satisfactory linear interpolation. The use of formula (15) improves matters, but 
is still not satisfactory. With Fisher’s tables we could, of course, interpolate 
between the levels a = 0-010 and 0-025; doing this with method 4, it is found that 

^0 0125 = ~ 2-971 


a distinctly better value. 
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(c) Interpolation for a beta-distribution percentage level. Take the case v 1 = 8, 
v 2 — 30 and suppose it is wished to find the % level from a knowledge of the 
1 % and 5 % levels. The data required are as follows, v 0 . 01 and « 0 . 05 being taken 
from Miss Thompson’s tables. 


cc 



0-01 

2-3263 

0-54170 

0-020 

1-9600 

? 

0-00 

1-6440 

0-62332 

— 


Then, using linear interpolation, i.e. method 3 of p. 273, we have 

1-9600-■ 1-6449 

%025 = 0-62332+ (0-62332-0-54170) x — ^ 

= 0-68558. 

The correct value taken from the same tables is 0-58582. It will be seen from 
Fig. 4 that the intervals a = 0-006, 0-010, 0-025, 0-050, 0-100 of Miss Thompson’s 
tables are likely to form a satisfactory framework for subtabulation if this is 
needed. 


8. SlTBTABTJLATION OF EXISTING TABLES 

These methods have been used to produce the following enlargements of 
existing tables; but it is not at present possible to arrange for their publication. 

(а) Table of y 2 percentage levels. Method 7 (p. 269) was used, as it is almost as 
accurate as method 8 and less laborious. The table calculated gives y| to 3 
decimal places for v — 1 (1) 30, andfor a = 0-010 (0-002) 0-020; 0-020 (0-005) 0-050; 
0-05 (0-01) 0-10; and for the corresponding levels at the upper end of the distribu¬ 
tion, i.e. for a' — l —a. 

(б) Table of t percentage levels. Method 8 (p. 271) was used for v = 3,..., 6, and 
method 6 for p = 7(1)30;40, 60andl20. Exact levels were calculated for v = 1,2. 
The tables were computed to 3 decimal places and for 

a = 0-005(0-001)0-010; 0-0100(0-0025)0-0250; 0-025(0-005)0-050; 0-05(0-01)0-10. 

9. Interpolation for the probability integral a, given u a . 

As we have already mentioned, the variable U a can be expressed as a poly¬ 
nomial in u a . This suggests that interpolation for U a , given u a , is as easy and as 
accurate as the interpolation discussed above. Hence to interpolate for a, given 
the value of u a and certain tabled levels u ai , u ai , ..., we first interpolate for U a 
and then find the value of a from appropriate tables of the normal probability 
integral, e.g. Tables for Statisticians and Biometricians, Part I, Table II. 
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10. Conclusion 

It has been shown how accurate values of the probability levels of a statistical 
variate may be interpolated between standard tabled values by using the stan¬ 
dardized. normal variate as auxiliary. For many purposes linear interpolation is 
adequate; for others a second order Lagrangian formula may be preferred. The 
accuracy of the result depends, of course, on the closeness of the actual prob¬ 
ability law to the normal law and on the size of the intervals between the tabled 
levels. The method has been illustrated on examples from the y 2 , t and v (beta) 
distributions. 

The method has been used to provide a subtabulation of existing tables of 
percentage points for y 2 and t, but it is not possible to have these tables printed 
with the present contribution. 

Finally I should like to express my thanks to Dr B. L. Welch of University 
College, London, for originally suggesting the problem to me. 
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PARTIAL RANK CORRELATION 

By M. G. KENDALL 


1. In interpreting an observed dependence between two qualities we are 
constantly faced with the question whether an association (correlation) of A 
with B is really due to the associations (correlations) of each with a third quality C, 
This has led naturally to the theories of partial association and correlation, 
which attempt to decide the matter by the consideration of subpopulations in 
which the variation of C is eliminated, An analogous problem arises in ranking 
work but, so far as I know, has not previously been considered. For example, 
if a number of men are ranked according to mathematical and musical aptitude 
and there appears a significant rank correlation, it is natural to inquire whether 
this may be attributable to the correlation of both with Borne more fundamental 
quality such as intelligence. The object of this paper is to propose a coefficient 
of partial rank correlation which has a natural meaning and may be found useful 
for investigations requiring this type of decision. 

2. As a preliminary it may be worth examining what can be done in this 
direction with the Spearman rank correlation coefficient p. If there are three 
rankings denoted by 1, 2 and 3, we may find the three coefficients p 12 , p 13 , p 23 . 
It is tempting to apply to these coefficients the formulae of product moment 
partial correlation such as 


n _ P m ~ PnPu 

Pi31 (1-/>?*)* (Wfs) 1 ’ 


( 1 ) 


and to define p z31 as the partial rank correlation of 2 and 3 ‘when 1 is constant’. 
There is clearly very little justification for such a procedure, and it is far from easy 
to explain just what p 231 means. In fact, the only defence of formula (1) that 
can bear critical examination is, I think, that it is an approximation to a second 
possibility, as follows: 

3. There can be no such thing as a rank correlation in a continuous population 
(the members of which are not even denumerable) but we can speak with genuine 
meaning of a grade correlation. A well-known result due to Karl Pearson states 
that in a normal bivariate population with correlation p p the grade correlation 


p u is s iven *>y 


p p = 2sin-p ff . 


The Spearman coefficient p may be regarded as a sample grade correlation. If, 
therefore, we take pas an estimate ofp g we may find p v from (2). For three rankings 
we may then obtain the three values of p p , apply the ordinary product moment 

.7 tx . 

partial formula, and so obtain a partial coefficient. Since x and 2 sin — do not 
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differ by more than a small amount for j x | < 1 , we might even apply formula 
(1) direct to the values of p without bothering to transform them into p v by 
equation (2), 

4. Such a procedure, again, is open to fairly obvious objections. Apart from 
the all-too-facile assumption of normality and the error involved in using Spear¬ 
man’s p from a small sample to estimate the grade correlation in a parent, the 
fact remains that we arrive, not at a partial rank coefficient, but at an estimate 
of a partial product moment coefficient in a normal population. 

Perhaps there are cases where this is a reasonable objective based on reason¬ 
able assumptions but it is evidently unsatisfactory for general ranking purposes. 

5. In a previous paper (1938) I defined an alternative coefficient of rank 
correlation r which may be generalized to include the case when pairs of in¬ 
dividuals are compared separately (Kendall & Babington Smith, 1940). It will 
be convenient for present purposes to redefine r in a slightly different manner 
so that the results obtained below may again be immediately generalized to the 
case of paired comparisons. Consider the two rankings of six 


1: 1 4 3 2 6 5 

2: 3 2 4 1 5 6. (3) 


There are 



= 15 possible pairs of ranks in each ranking, viz. 12, 13.16, 23, 


24, ..., 56, We write them down as in expression (4) below. Any order of the 
pairs will serve, and it is immaterial whether any pair is written as ab or ba; but 
for practical convenience they may be written in the natural order indicated in 
the last sentence but one. This arrangement I call the recorded order. 

We then consider the occurrence of each pair in the ranking l. If a pair occurs 
in that ranking in the order in which we have recorded it, we write a plus below 
the recorded order underneath the pair concerned; in the contrary case we write 
a minus. Ranking 1 of expression (3) will then give 


Recorded order: (12) (13) (14) (16) (16) (23) (24) (26) (26) (34) (35) (36) (46) (46) (56) 
Ranking 1: — “d-d- — d-d~d*d" — 

(4) 


Here, for example, the pair (15) occurs in that order in ranking 1 and so is 
denoted by a +, whereas the pair (24) occurs as 42 and is denoted by a —. 

Consider now ranking 2. The members of ranking 1 which are ranked 1,2 
correspond to members in ranking 2 ranked as 3,1. This is in the reverse order 
to that of the pair 13 in the recorded order, so (starting a new row of signs corre¬ 
sponding to ranking 2) we write a minus under the recorded pair (12). Similarly, 
the pair in ranking 2 corresponding to 15 in ranking 1 is 36. This is in the same 
order as the recorded pair, so we write a plus below the existing pins under the 
recorded pair 1 (15). The pair in ranking 2 corresponding to 23 in ranking 1 is 41. 
This is in the reverse order of the recorded pair, so we write a minus below the 
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recorded pair (23) in the row of signs corresponding to ranking 2. And so on. 
This takes rather a long time to explain but the process is really very simple. 
The array corresponding to expressions (3) is then 

Recorded order: (12) (13) (14) (15) (16.) (23) (24) (26) (26) (34) (35) (36) (46) (46) (56) 
Ranking 1: + + + + + -'— + + - + + + + - 

Ranking 2: — + — d-d- — — + + -|-4--f-4-d-d- 

( 6 ) 

Now in expression (5) there are eleven oases in which both rankings have the 
same sign and therefore 15 — 11 =4 in which they have the opposite sign. The 
coefficient r is then given by 

_ H-4 _ 7 
Tl2_ 15 — 15’ 


Generally, if there are, in two rankings of n arrayed as above, cases of the same 
sign and iS 2 of opposite sign 

Tr ^(S 1 -S 2 ) 

n(n — 1) 


, 4 S t 

n(n-\) n{n — \y 


( 6 ) 


If we arrange ranking 1 in the natural order 1, ..., n, then every case in which 
there are the same signs in expression (5) corresponds to a case in which pairs in 
ranking 2 are in the natural order; and every case of different sign to one in which 
the pairs in ranking 2 are in the reverse of the natural order. The definition of (6) 
thus accords with the one originally given in my 1938 paper. It is often con¬ 
venient to take one ranking to be the natural order 1, ,.,,aso that the first row 
of signs in (5) are all positive. For instance, on rearrangement of the rankings in 
(3) we have 

1: 1 2 3 4 5 6 

2: 3 1 4 2 6 5 (7) 

and the array of paired comparisons becomes 

Recorded order: (12) (13) (14) (15) (10) (23) (24) (25) (26) (34) (35) (36) (45) (46) (56) 
Ranking 1: + + + + + + + + + + + + + +• + 

Ranking 2: — + — + + + + + + — + + + 4" 

( 8 ) 


Here again 8 X = 11, = 4, r 12 = jg. 

6. Consider now three rankings, of which the first may be taken to be the 
natural order 1, for example, 



1 : 

2 : 

3: 


1 2 3 4 5 

3 14 2 6 

4 2 16 3 


6 

5 

5 


(9) 
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The corresponding array of pairs is 


Recorded order; (12) (13) (14) 

(15) 

(16) (23) (24) 

(25) (26) 

(34) 

(36) 

(36) 

(46) 

(46) 

(56) 

Ranking 1: 

+ + + 

+ 

+ + + 

+ + 

+ 

+ 

+ 

+ 

+ 

+ 

Ranking 2: 

- + - 

+ 

+ + 4- 

+ + 

— 

+ 

+ 

+■ 

+ 

— 

Ranking 3: 

— — + 

““ 

+ - + 

+ + 

+ 

+ 

+ 


“*" 

+ 

(101 

For the coefficients r we 

have 










T 12 

(as above) = 

7 

15’ 










9-6 

3 









r is 

15 

15’ 










7-8 

1 









r 28 

15 

15’ 








in the last case 8 t being as usual the number of cases in which pairs in rankings 
2 and 3 have the same sign. 

Consider now the fourfold table setting out the occurrences of + and - 
signs in the rows of expression (10) corresponding to rankings 2 and 3: 


Banking 2 



+ 

- 

Total 

+ 

6 

5 

11 

- 

3 

1 

4 

Total 

9 

6 

15 


( 11 ) 


Here, for example, there are six cases in which pairs of ranks have the same 
(positive) sign, five in which ranking 2 is negative while ranking 3 is positive, 
and so on. 

Generally, if for three rankings of n the table is of the form 


Ranking 2 



+ 

- 

Total 

+ 

• 

a 

6 

0 + 6 

- 

0 

d 

c+d 

Total 

a + c 

b+d 

N 


( 12 ) 





281 


M. G. Kendall 


I define the partial rank correlation of 2 and 3 on 1 as 

T __ ad —be 

231 ~ +V{(« + b) (c + d) ( a+c)(b + d)} 


(13) 



(14) 


where N = 



and x 2 is the ordinary mean square contingency for the four¬ 


fold table. 

In the particular case here considered the coefficient is 


6-15 

Ta31 ""7(4.11.9.6) 


-0-185, 


as compared with r 28 = —0-067. 

In the case when ranking 1 is not the natural order 1, ..., n the same prin¬ 
ciples apply, but in considering the communalities of 2 and 3 we count as con¬ 
tributing to the a-cell in (12) the pairs which are themselves of the same sign and 
are also of the same sign as the term in ranking 1; and so on. 

7. The partial r defined in equation (13) is a coefficient of association in 
the 2x2 table suggested by Yule (1912). When the attributes of the table are 
independent and only in this ease it is zero. It is 4-1 only if 6 and c are both zero 
(i.e. if the two rankings agree in all pairs and hence are identical) and - 1 only 
if a and d are both zero (i.e. if the two rankings disagree in all pairs and one is 
the reverse of the other). In this latter property of attaining unity only when 
diagonally opposite cells are both empty it differs from two other coefficients 
proposed by Yule.* 

Thus partial t as defined is a measure of the association of agreements of the 
rankings 2 and 3 when compared in pairs with ranking 1. From this viewpoint 
it will be seen that the use of the word ‘partial’ conforms to that in the theories 
of association and correlation. The partial associations of A and B in a third 
population containing G and y are those of AG and BO or of Ay and By. The 
partial association of ranks is that of pairs of agreements of 21 and 31. The partial 
correlation of 2 and 3 when 1 is constant is paralleled by the partial rank correla¬ 
tion of 2 and 3 when 1 is in the natural order 1, ..., n. If partial r is unity in 
absolute value the rankings coinoide or one is the reverse of the other, whether 
they agree with ranking 1 or not, so that the coefficient fulfils its proper function 
of measuring the relationship between 2 and 3 independently of the influence of 1. 


* Yule himself arrived at the coefficient by considering product-moments in a 2 x 2 table. 
Karl Pearson & Heron (1913) mistook Yule’s intention and thought the ooeffioient was proposed 
as an estimate of the correlation in a normal population whose frequencies were given by a double 
dichotomy in the 2x2 table. Their long memoir is mainly devoted to advocating the alternative 
merits of tetrachorie r. The two things, as Yule has emphasized, are quite different. 1 mention 
the point to make it clear that the Pearson-Heron criticisms of Yule’s coefficient, even if not mis- 
founded, do not affect the above work, since I use the coefficient purely as a measure of association 
in the fourfold table. 
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8. We may establish the remarkable result 

_ r 23~ T 12 T 13 

a3a <1 -t? 2 )*U—rfa)*' 

la fact, from expression (12) we see that 

(a + b) — (c + d) 

Tlz ~ N 

{a + c)~ (b + d) 


{a+d)~{b + c) 
r ‘ a ~ N 

Remembering that N ~ a+b + c + dwe have 
l-'rli=*-fi- 2 (a+b)(c+d), 


' (15) 


1 - r i3=(p( a + C ) ( b + d )> 

r 23 ~ T n T is =~^[{a + b + c+d){(a+d)~{b + c)} 

— {(d + b) — (c + c£)} {(a + c) — (b + d)Y\ 
4 

= ^( ad - 6c )- 


Equation (15) follows at once from the definition of partial r in equation (13). 

The appearance of the product-moment type of relation between total and 
partial correlations is surprising. There was no reason to expect that partial r, 
which is a pure function of disarrangements in rankings and is not expressible 
algebraically in terms of the ranks, should bear any analogy with the partial 
correlation of variates; but since it does so, we are evidently fortified in regarding 
partial rasa convenient measure of rank correlation. 


Example. Ten men are ranked according to (1) intelligence, (2) mathematical ability, 
(3) musical ability. The rankings are: 

1:123466789 10 

2:146627398 10 

3: 4136267 10 9 8 

It will be found that r n — 0'644, r 13 ~ 0-644, r a3 = 0-666. Thus mathematical and 
musical ability are positively correlated. The question is, can this correlation be attributed 
to the correlation’ of both with intelligence ? 


We find 


_ 0-666 — (0-644) s 
7231 1--(0-644) a " 


= 0-24. 


The conclusion, would be that although part of the total correlation is due to the correlation 
of both with intelligence, part of it is not. We cannot attribute the whole of the observed 
(total) correlation between mathematical and musical ability to the interference of common 
correlation with intelligence. 
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9. The same methods are immediately capable of extension to paired com¬ 
parisons. In fact the array of type (10) is the array of paired comparisons for all 
the possible pairs of rants and when there are no constraints of the ranking 
character (i.e. such that if A -» B and B~±0 then must A-rG) the coefficient r 
becomes a measure of agreement in paired comparisons (of, Kendall & Babington 
Smith, 1940). We could then construct measures of partial agreement by the 
same formulae in cases where it was suspected that there were mutual influences 
at work between three observers, as for instance if it were suspected that com¬ 
munity of preference between two children was due to community of both with 
one of the parents. 

10. In conclusion, it may perhaps be desirable to point out that although 
partial r is defined by equation (14) in terms of y 2 for a fourfold table, its signi¬ 
ficance cannot on that account be tested in the Type III y 2 distribution with one 
degree of freedom. I have not yet succeeded in finding expressions for the sampling 
distribution of partial r but it seems clear that the Type III distribution will not 
be reproduced in the ranking case, at least without some substantial modification, 
even when the rankings are independent; for there exist correlations between the 
signs given by any ranking in the recorded order. If, for example, (12) and (23) 
are positive so must be (13), whereas if (13) and (23) are positive (12) may be 
either positive or negative. The units in the fourfold table cannot therefore be 
regarded as allocated at random and the type III distribution will probably not 
hold. I hope to return to this subject on a later occasion. 
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INEQUALITIES FOR MULTIVARIATE FREQUENCY 
DISTRIBUTIONS 

By 0. E. V. LESER 


Given a frequency distribution y o fa single variate x, with arithmetic mean 
£0 = 0 and with standard deviation<r, Tchebycheff’s (1867) well-known inequality 
presents a lower limit for the ratio of the frequency of all values of x between 
-An and Acr to the total frequency, where A § 1. In the ease of the special class 
of frequency distributions for which y(x) + y( - x) is a non-increasing function of 
| x |, this inequality can be substantially improved to another limit which applies 
to all positive values of A, as already Gauss (1823) has proved. Various authors* 
have generalized these theorems by modifying the assumptions made with 
regard to the frequency distribution, by introducing moments of higher order 
than the second or by extending some of the results to bivariate functions, In 
the present analysis only moments of second order are used, but the results apply 
to frequency functions of any number of variates. 

Suppose y(x 1 , to be a frequency distribution of n variates, with total 
frequency equal to unity, arithmetic mean at the origin and with standard 
deviations oq, r n . Let P be the frequency of all combinations of x 1 ,...,x n 
for which , , 2 , , 2 

rH +-+hrH 
w WJ 


Write 


A n = 


n 


i °“ViA!+...+i/Ar or ° V1M+...+1W’ 

so that A® and <?l are the harmonic averages of A|,..., A| and <r \,.... respectively. 
We also write 

A . rr. 

(i = 1 


n 


A i <r. 


ft-T’ T * „ 

A<) ^0 


R=* 


v) + -" + fe)i‘ v, y{fe) + - + fe)i 


so that P is the frequency of all values of aq,..., x n for which R(\Jn g A 0 cr 0 . Further¬ 
more, we define A(R 0 ) as the average value of y for all those values of x v x n 
for which R has a fixed value R 0 , and therefore 


A(Pq) — 


... ydx 1 ..;dx n 
= J __ 

... dx 1 ...dx n 
Jn=R 0 1 51 


* E.g. K. Pearson (1919), B. H. Camp (1922), S. Narumi (1923), 0. P. Smith (1930), P. O. Berge 
(1937). 
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We can now state the following: 

Theorem. Assuming the frequency distribution to be such that A(B) is a 
non-increasing function of M for Rjfn § Arcr 0l we have one of the following three 
sets of inequalities, according to the value of k : 


I. 


kAI: 


(i) 


^o=l 

P> 0, 

(ii) 


1^A 0 

1 

p>l_A 
= A§ 

II. 




(i) 



ri”t 2 (i-*,)( A s " 

(ii) 


(»+s) KiX °s* 


(hi) 


A 0 

p> Jl 
= A 2 ’ 

III. 




(i) 




(ii) 1 


W»/» + 2\* . / 2 \*/» 

l n ) u = (w+2/ 


(hi) 


/ 2 \ Vn •, 

U+s) k -^ Sk 

p^i-4. 

/c 2 

(iv) 


Ki A 0 

p>1-—. 

- AS* 


Proof. We introduce the new system of co-ordinates 

aq = cos t x cos t 2 cos t 3 ... cos t n _ t , 

x 2 = /i 2 T 2 jRsin t x cos t 2 coat 3 ... 008 t n ~n 
x 3 ~ /i 3 r 3 B sin t 2 cos t 3 ... cos t n _ v 


so that 
where 


X n — /J'nTn P S ^ n t‘n—1’ 

3 /(aq, ...,x n )dx 1 ... dx n = z{R,t x> ...,tn-i)BA~HRdt x ... dt n _ x , 
z(B,t v = y(R,h, ...,t n -i)°oat 2 cos%... cos ?i—2 t n _ x . 
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We also write s = cr 0 fn, 

and we are going to use the following abbreviating symbol 

I f(H t t)dt= ... f(£,h . tn-i)dh...dt n _ v 

JT J li=oJ /i=—Iff 

Then f B n 'H(R,t)ARdt = 1, 

f f R n ~ l z(R,t)dRd,t = P, 

Jn^ojT 

and as A (JR) — const. z(R, t) dt, we have the condition that z(R, t ) dt 
J T JT 

increasing function of JR for R ^ks. Furthermore, let us write 

j z(A 0 s, tJj dt i 

JT 


G 


and later on 

Now start from the equation 
vs 


G 

u — —s n . 
n 


(jay + ... + /siy 


• 00 

P 03 1 

["( ** )*+... + ( *’ 1 Y] 

J « — CO * 

a'n^-co 1 

lVi 7 i! \An r J J 


which can he written 

(Vo) 2 (^ + - +Xa) “ 
or a 2 ~ +1 2 , 

A = P” S f R n -+ 1 z{JR,t)dRdt, / a = f" f R n+1 z(R,t)dRdt. 

Jli**0JT jR-AtsjT 

According to the value of ks , three cases have to be distinguished. 


(a) ks 
For R-& A 0 a 
Hence 


t % ~Uln 

-p) 


or PS1- (k 11 — Aj l ) u. 
J z(R,t)dt^G. 


f A„s 

i 

JR** 0 


R n+1 dR, 


''A 8 

and incidentally P> (? ” R n ~ 1 dM = 

J i?=0 


is a non- 


.., dx n 
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To obtain a limit for jT 2 , we make use of the equation 

r« r r [(A 0 8)’’+|a-p)] ,, ’‘ 

R n -h(R,t)dRdt = l-P = G\ R n ~'dR 

JR=A t sJT Ja=A,s 

or f R n ~ 1 z(R,t)dRdt 

J JJ „[ (Ao 8 )n+ « ( 1 _ P) ]''“J 2 ' 

|.[(A,s)»+g(l-P)]“ n p f 

= | ^ R^lG -J z{R,t)dt dR. 

As the integrands are nowhere negative and R nowhere smaller in the integral 
on the left than in the integral on the right, we can write 






and therefore 


f R n+1 z(R,t)dRdt 

J T 

r [(A aS r+|a-p)] ,,n r f 

g P w+1 0- z(R,t)dt \dR, 

Jn=A„s L Jt J 




.[(Aosy+ga-P)] 1 '" 


B“A „s 


R^-HR, 


iG L„ R m -Sniw®—■ 


n iln (n + 2) v? g [n{X%u + 1 ~ P)]<»+ 2 >M ) 


Pgl+A 




■ n + 2\«/(»+2) 


n 


^ 2 /(tM- 2 )^ 


r V, ~\Vn 

(6) \ 0 s<K8< (A 0 «) ?1 + ^(1-P)J or P<l-{K n -\$)u, A 0 <*. 

As before, g G f R n+1 dR and P g Ag w. 

Jb=o 

Furthermore, we have the equation 

f R n - 1 z(R,t)dRdt = l-P = oi kS R n ~ l dR + fl - P - ^ (k w - A{, 1 )s”l, 

J“ J RP-H{R,t)ARdt = R n ^Q-j r AR,t)dt^dR 

+ fl-P 


l_p_^( /f »_A»)s* , 
» J 
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and, as before, it follows that 

f" f B n + l z{R,t)dRdt>r z(B,t)dt]dR 

Jr^ksJt Jr=+s L Jr J 

L A 0 f " R"+HR + (ks)* fl - P - - (** - K l ) « n ] • 
J R-k# L 71 J 

Hence s 2 fe 0 f’ R»+'dR + [Ksf |~1 - P - - (* n - A' 1 ) a 1 -!, 

J R=0 L n J 


1 = K ‘ l+ “ u + k 2 [ l ~ P- (x n -Ag) w], 

(c) ks < A 0 s or A 9 Sk. 

In this case, I x need not be larger than 0, but 

4s(A oS ) 2 (i-P), 

s 2 £(A 0 s) 2 (l~P). 

Therefore, if A a S 1, P > 1 - ^, and if A 0 < 1, no significant limit exists. 

A 0 

For A 0 & k, the problem has thus been solved, but the case A 0 g x is more 
complicated; summarizing the results of (a) and (6), we have two alternative sets 
of inequalities. Write 


A(w) = A^m, ffu) = 1 - (x» - A") u, 



M u ) = 1 + Aj2t 

(n + 2 

\ n 

\ nIO 1 + 2 ) 

J w 2/(n+2)^ 



-L.I X 71 _ 

2 U 


+ \ A ° 

n + 2 J 

Then either 

PZfv 

P*f» 

PZA, 

or 

P*A, 

P£A, 

PZA- 


For different frequency distributions, G and therefore u may assume any 
non-negative value which is compatible with the condition P^ 1. We have to 
find the effective lower limit for P as a function of u and its minimum value 
which, as easily seen, is only larger than 0, if / 4 (0) > 0 ot, what is the same thing, 
x > 1. 

A> A> A are straight lines. As 


dfs 

du 



,-»»/(n+2> 



c. E. Y. Leseb 


289 


/ 3 has a minimum when 


and / 3 (mln ' ) = 1 + - 


n + 2 

( 2 ' 

,(n+2)/re 

1 

_2/ 2 ) 

2 /ft J 


n 

\n + 2, 

) 

A? 4 * 2 ' 

n\n + 2j 

1 A"+ 2 


U 2 \ 

2 /ft l 

%+2 

/ 2 ’ 

\ 2,n 1 , 

/ 2 i 

,2/n 2 

•\n+2f 

AT 

» * 

+ 2 

) A 2 " 1 

U+2j 

1 Af 


Indicating by f^ the co-ordinates of the point in which the curves f^ and fj 

intersect, we find that 


I n 

W + 2 

n+2/ 
2 \ 


Mv)”- 

f “ ~ (v/v) As ’ 


/•>> f 3 anc * ft have a point (u 23i ,f m ) in common in which f i is the tangent to/ 3 : 


_ n+2 1 n + 2 K n ~A% 

234 n K n + 2 ’ , ' 234 w K n+Z ■ 


It is also seen that the sign of both expressions — and -u 234 - u ( g niIn - ) is positive 

/ 2 \V» 

or negative according to whether A 0 ^ -- a-. 

\u -f*2/ 

We have therefore four possibilities. In the following diagrams, the heavy 


Case 1 


Case 2 
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line shows the effective lower limit for P as a function of u (the upper limit is 
always equal to 1, of course), and its minimum gives the general lower limit for P, 
In these four cases, the following results are obtained: 


(!)) 


"^234 s£ 

PU U. 

(2) • 

(3)J 

© 

VII 

§ 


Pfe/w. 

(4) 

© 

All 


pm o), 


and by substituting the proper values: 



In addition, we know that for A 0 Si a:, A 0 ^ 1: P g 1 — —. By rearranging these 

do 

inequalities, we can bring them into the form in which they were given in the 
theorem, which is therefore proved. 

It may be remarked that k depends on the ratios between any two of the 
quantities A x , ,..,A n , but it remains the same if A x ,..., A ?l change in the same 
proportion. 

Let us consider the sets of inequalities I, II and III of the theorem separately. 
The most important case in which set I is relevant occurs when nothing about the 
frequency distributions known except the averages and standard deviations of the 
variables, in which casewehavetoput k- 0. It provides a generalization of Toheby- 
cheff’s inequality which is obtained in the special ease to — 1 in which A 0 = A. 

If n — 2, the inequality refers to the frequency of all points lying inside the 
ellipse which has the axes A x op A 2 *]2 cr a , and it can be written in the following 


way: 





It is interesting to compare this limit with the one given by Berge (1937) for a 
rectangle which has its corners in the points with the co-ordinates (+ A<r x , ± A<t 2 ) : 


p > i i+V(i ~ 

“ A2 


where r is the correlation coefficient of the frequency distribution. If r = 0, 
Berge’s limit equals 1 — 2/A 2 and is therefore equal to the limit we get for the 
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ellipse with the axes Ac r v Atr 2 which is inscribed to the rectangle. On the other 
hand, if r = ± 1, Berge’s limit becomes equal to 1 — 1/A 2 , and we have to choose 
the circumscribed ellipse with the axes A Ai /2tr 1 , \*j2cr 2 to obtain the same limit. 
Hence, the limit given here is certainly not inferior to the limit given by Berge, 
if the two variates are independent, but it is not superior to this limit, if there is 
a perfect correlation. 

Set II is perhaps of more theoretical than practical interest. An intermediate 
value of k may, however, he realized, if there is sufficient information about the 
frequencies inside, but not outside, a certain w-dimensional interval. The in¬ 
equalities correspond to those given by Narumi (1923) for frequency distributions 
of one variate only, but having a more general meaning in so far as P may refer 
to multiples of other quantities besides the standard deviation. The first inequality 
seems the most interesting one of this set; it reduces to a special case of Narumi’s 
inequality if we put n = 1: 

and for n = 2 it may be written 




or 




1 


X\ ' x 2 (l/A| + l/A|)’ 

Set III generalizes Gauss’s inequalities. It reduces to the first two inequalities 
for those values of A x ,..., A m (if any) for which k = co. This is the case for all 
values of A 1; ...,A„, if y(hx v ...,hx n )+y(-h% v -hx n ) is a non-increasing 
function of | h [ for any fixed values of x u ..., x n \ especially if the function y 
decreases monotonically along every straight line radiating from the origin. 

The assumption k = oo will be made throughout the following analysis. 
Gauss’s formulae are obtained by substituting n = I: 

In the case n = 2, the inequalities are also greatly simplified: 

A 2 


A 0 S1:P£1 2A g, 


or 


A_ + _2:2: P> 

x\ + xr ~ 
JL + _L<2:P> 1 

A f + Al= “ 


1/Af+l/Ai’ 

I/l + 

4\A? A1 


*l)‘ 
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Returning to the case of an unspecified value of n, we shall often be interested 
in a limit for the frequency of all values of the standardized or original variables 
for whioh the distance from the origin is no more than a certain value, i.e. either 


Wrtj - n 


er£ 


or 


x\+... + xl<p\ 


In the first problem, we have 


Aj — ... •— A tt — A 0 — 


P 

Jn' 


and our inequalities can therefore be written 


P* S 2**(*+ n n ~ m ■ P i 


2 2 iln (n + 2f n ~ z H n :PS 1 - 


/ 2 yi n n 

\w + 2/ p 2 ' 


In the second problem 

AjO-j = ... = 

Hence, if 

pi g 2 */»(» + 2 

2 2 ' ,ft (n -f 2 )<«-sVn 


" Xnffn 'Jn' A ° >?-/.. + <)• 
a- 2 +...4-tr 2 . p> T np % "I** 

» ' L( ri + 2 )(< r !+---+o'l)J ’ 

of+••■+<£ . P> , _/ 2 \»* aj+...+(7« 

ra ' \n-f2/ p 2 


A.gain, the insertion of special values for rt will simplify the formulae. 

Finally, it may be at least of theoretical interest to consider the generalized 
Gauss limit, as obtained for k = oo, as a function of n for fixed values of A 0 , and 
to compare it with the generalized Tchebycheff limit which is obtained for k 511 
and is independent of n. For some particular values of A 0 and n, this is done in 
the following table: 


\ ^0 
w \ 

1 

2 

3 

1 

0-577 

Gauss limit 

0-889 

0-951 

2 

0-500 

0-875 

0-944 

3 

0-457 

0-864 

0-940 

4 

0-423 

0-856 

0-936 

Any value 

0 

Tchebycheff limit 
0-750 

0-889 
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Furthermore, since for / —j* ) 

lim ( 

l—■*■> CO \‘ 


n 


and since 


lim . 

71—>00 4- 2/ 

lim 1 - 

n— j-otL 


■ lim 


/ 2 \ 2 M 11 _ 

\% +• 2; Ag_ 


»->■ oo it ~f~ 2 
1- 


it is seen that with an increasing number of variates, 
ally its superiority over the Tchebycheff limit and 
two limits tends to vanish. 


= 0 , 

j_ 

the Gauss limit loses gradu- 
the difference between the 
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THE MODE AND MEDIAN OF A NEARLY NORMAL 
DISTRIBUTION WITH GIVEN CUMULANTS 

By J. B. S. HALDANE, F.R.8 . 


In the early years of biometry the mode and median of a distribution, and 
especially the latter, were regarded as being almost as important as the mean. 
Later Pearson and others developed the method of moments, and recently the 
cumulants, which are readily derived from the moments, have been widely used. 
Pearson (1895 and after) discussed the relation of the mean, mode and median 
in some of the skew frequency curves which he invented. He found empirically 
that for skew curves of Type III, namely, 


V = + e-t> xla , 

when p is positive, Mode - Mean = 3 (Median - Mean) approximately, By fitting 
for a series of integral values of p he found 


Median-Mode 
Mean-Mode 


0-6691 + 0'0094p-h 


Some later writers have taken the trisection as a general law. But as we shall 
see, it does not always hold, even approximately. So far as I know, general 
expressions for the mode and median in terms of the cumulants have not been 
given, nor have the conditions been stated under which Pearson’s rule holds 
approximately. 

Consider a variate Z, with distribution dj - F(X)dX, and cumulants 

K-y-m, ~ oA K s ,...,K r ,.... 

The algebra is simplified if we make the transformation * — (X — m)j(T , so 
that x has mean zero, and unit standard deviation, its cumulants being 0, 1 , 7 !, 
y 2 ,..., y r ,..., where y r _ 2 = K r K^ ir , Thus y r is the rth measure of the deviation from 
normality of the distribution of Z. Now y T may become infinite for all values 
of r, or for all even values of r, above a certain value. It may diminish indefinitely 
or remain small. In other cases it falls with r at first, but then increases without 
limit. Thus for Pearson’s Type III distribution, whose equation has been given 
above’ 

Kt* ( r -l)I(P+1). and y r = (r+1)! (p + l)-*. 

And in the case of any estimate of a statistic, such as the mean, variance, 
standard deviation, or skewness, based on a sample of n members, y r = ()(n~ ir ). 
We shall not discuss the convergence of the series obtained later. But it is worth 
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noting that expansions in Hermitian polynomials are often satisfactory asymp¬ 
totic expansions, even if they diverge. 

Let df = f(x) dx be the distribution of x. Then it is known that we may write, 
symbolically,* 


f{x) = exp 


, y±(±\\7±(±Y __ ,y r-2(~dV 1 
L 3!\da;/ 4 \\dxj "' + r\ \dx) 




JVW 


Vi 


Expanding in Hermitian polynomials HJx) = e-** 2 , we have 

ax r 




y 3 


+ 


10y|+y 4 . 
6 ! 


35yi y 2 +y 5] 


7! 


' r5 H,, 

In the special case where y r = 0(n~ ir ) we have 

w = jS +#•<*> 






To find the mode, we put = 0- Hence 


H 1 (x)- J jH t (x)+^H 6 (x) + ...^0. 

The terms needed to find the root of this equation with the smallest absolute 
value depend, of course, on the behaviour of the cumulants as r increases. If 
] y 2 1, | y a j, etc., are not substantially less than j y 1 j, but all are small, so that 
powers and products can be neglected, then: 


*—£+* 76 


y? 


8 48 576 


Where y f = 0(n~ ir ), we have 


Hence 




So in the first case the mode is 

y 5 


m + 


i-^T- 


48 


+ ••• + 


(-) r y 2 r - 

2 r r\ 


,+ -) 


= K 


2_ i | ( Y K 1r+\ , 

2/c, ' 8xf 48x» 2rrl/cj 


_*3 | ** 

1 c\ -- ' C» . .0. 


v 2 


(i) 


* For a comparison of this expansion with that of the Charlier Type A distribution, see Cramer 
(1937). The symbolic relation has also been used recently by Cornish & Fisher (1937). 
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If K r = 0(n), then the mode is 


, _ a I _ 

' *1 o, T e*-2 


5/c 3 k 4 i k| 


^+0(r“) 


2/c a 8<| I2 ac| 4/cf 

- 4 . tL _ , 4 . nr«-a\ 

/ * 1 2fi z 8/iJ 12/*1 4/ij 


nx 

The median is the value of x for which f(u) du = or, 

J — 00 


since 


the value for which 






So 


e ix> 




6 ^ ' 24 

If y r does not fall off systematically we have 

a:== _Ii + ^_-ZL + . + lrT.Y*r=L + 

6 40 336 2 r (2r+])d T 


When K r = 0(»), we find 

a: 3 y 4 a 
3 6 


+ ^1 + 


7? sy! 
8 


2 r (2r+ l)r! 

71^,71 7a , 5y x y a 35y 3 
4 / 6 


40 + _ 48 


432 


4- 0 ( 71 -*) = 0. 


Hence 




6 40 12 1 324 

So if powers and products can be neglected the median is 


w + 

» 


J_2i + rs_75. + .\ 
\ 6^40 m ) 


_ „ *3 , * 

- K i~^: + 


1 6*, 40*1 336k§ 


+ ... + 




2r+l 


2 r (2r+ l)r! 4 




And when k t - 0(n ), the median is 




k 3 k 4 


+ ; 


174 


6k 2 40k| 1 2k| 3244 


0(n - 2 ), 


= »’ + Jf«_ + JM. _ 0(^-2) 

™ •6^,^40/tJ 12/dj 324/^! * 


(2) 


(3) 


(4) 
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Thus the distance from the mean to the mode is approximately thrice that 
from the mean to the median if y„ y 4 , etc., are small quantities (i.e. /? 2 is nearly 3 , 
etc.) and if y a , y 5 , etc., are small compared with y 4 . This is often the case for 
nearly normal distributions. Thus for the best known of Type III distributions, 
that of y a for n degrees of freedom, /q = n, /q = 2 n, ,K r = 2 r ~\r~ l)!n. Hence 
from equation ( 2 ) the mode is at n~2 + 0(n~ 2 ). In fact it is n — 2 exactly. For 


2 32 

the median, equation (4) gives n-~ +—— 0(n~*). The fraction which Pearson 

2 & 

empirically estimated at 0-6691 + 0-0094yr' 1 is therefore—h——- + 0(p~ 2 ) 

3 40 5p 

Consider the distribution of the estimated mean, n~ x £x r , from a sample of n, 
taken from a distribution with arbitrary cumulants /q, at 2 , k 3 , etc. The cumulants 
of the distribution of the mean are /q, K 2 jn, K. i jn i , etc. Hence equations ( 2 ) and 
(4) hold, and the mean is k v the mode is 


*3 

2 nKc 


+ 


\8a 


5k 3 k 4 


8*2 


124 


-b 


4 


4 4, 


and the median is 


* 3 4- 

( K s _ 

*3*4 

6 n/q 

Uo*i 

12 4 


+■ 


j n~ 2 + 0(n- z ), 

y 


iM 

324/c| 


| n~ 2 + 0(n~ s ). 


The corresponding expressions in terms of the moments may be written down. 
The.skewness is not exactly ljn of the skewness of the original curve, but nearly 
so if n is large. 

Again consider the distribution of the estimated variance from a sample of 
n members from a normal distribution with unknown mean, and variance a 2 . 

The cumulants of the distribution of the estimated variance——^ [£% 2 - n~ x {£ a;,.) 2 ] 
are 

„ 2 8<r 6 48cr 8 384cr 10 

*i = uT, * 8 " (n— l) 2 ’ Kl ~(n- 1) 3 ’ ' Cs_ («-!)*’ ® °* 


Thus we are dealing with a slightly transformed y 2 distribution, and the mode is 
2/r2 f __ 3 ^ 

0-2 -, or --— , exactly. From equation (4) the median is 

n —1 n —1 


<T‘ 


32 


3(»-l) 405(%-l) 


,+ 0(n~ 3 ) 


, or <7 2 1 


J2_ 

3 n 


302 
405 n 2 


+ 0(n' 


“ 3 ) • 


If, however, the distribution sampled is not normal, but has a finite /c 3 , etc., 
then the first three cumulants, in terms of those of the original distribution, are 


2*1 *4 , 84 , 4(n —2)4 12x 2 x 4 x 6 

K *’ n-l + n and n{n-l) 3 + n{n-l) + n*’ 

Fisher (1928) gives expressions for the next three cumulants, but these become 
very complex, k( 2 6 ) having 21 terms. 
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The mode and median of a distribution 


It follows that the mean is x 2 , the mode 


and the median 


K 


K 


a - + 
T2/C, 

-[x + 


4*§ + 8* 2 a: 4 + y 6 ~[ 
2(2/c| + /C 4 ) J 

4xf+ 8 /c 2 k 4 + k 6 “| 

6(2at| + kJ J 


w -1 + 0(n~ z ), 
w -1 + 0(n~ 2 ). 


The distribution of the estimate of k 3 is symmetrical for a symmetrical dis¬ 
tribution. In general its mean is x 3 , its mode 


*3“ 1 


45/c, + 


-558xf + 216/C2AT S + 108 /c 4 a.' s — §K a K e + 27 k 2 k^ + *9 . ,q , 2 , 

2(6d| + 9k§ + 9/c 2 k 4 + x 8 ) J 


the median differing from the mean by ■$ of the value given above. 

Finally the mean estimate of at 4 for a sample from a normal distribution is 
36c 4 12a 4 

zero, its mode-— + 0(n -2 ), and its median ———h 0(n~ 2 ). The general 

expressions for the mode and median can readily be calculated from Fisher’s 
(1928) equations. 

We now pass to some cases where Pearson’s trisection rule does not hold, 
even approximately. 

Pearson’s Type I and Type IV curves are asymmetrical, and have one more 
adjustable parameter than Type III. Thus y 2 , i.e. /i 2 - 3, can vary independently 
of y lt i.e. In consequence the curves may be nearly symmetrica], but far from 
normal. This is so if they approximate to Type II or Type VII, respectively. In 
this case the even measures of divergence y 2r may be much larger than the odd 
measures y 2r+1 , which tend to zero with symmetry. Thus we cannot neglect 
higher cumulants, or products, as in equations 1-4. For Type IV and VII curves 
the higher moments are infinite, so no formulae of the given type are possible. 
For Types I and II a formula could be given, but direct calculation is clearly 
preferable. 

A simpler case is that of the scalene triangular distribution whose graph is 
obtained by joining (-6,0), /o, j, and (a, 0). That is to say 

2(6+*) 2 (a—x) . 

y = 6(0+6} tor “ 4 »- 55 +Jj for 

Here the mean is |(a — 6), the mode 0, and the median, if a > b, is 


a -V , [|«( a + 6 )]. 


or 


a~6 (a ~b) 2 (a-6) 3 
4 32a ^ 128a 2 + 


That is to say, the median is one-quarter, not one-third, of the distance from the 
mean to the mode, if a — 6 is small compared with a. The rth moment about zero 

is 2[a r+1 - (- 6) f+1 ] 

r(r + l)(a + 6) 



If h = 


a—b 
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6^[2fc(9 + 2P)] 
7(3 + Jb 2 )» 


etc. 


voa»- then r ‘ - J i^+FT ri ‘ -*■ t. - 

If fc is small, the odd y s are of order k, whilst the even ones approximate to those 
of the symmetrical distribution. Moreover y 3 = ~^y v so the formulae 1-4 
clearly do not hold. 

We see then that formulae (2) and (4) are quite useful in a special case which is 
important in sampling theory, but have no general validity. 


Summary 

Expressions are obtained by which the distances of the mean and median of 
a skew distribution can be calculated from its moments or oumulants. The series 
obtained may or may not converge. They give satisfactory results for Type III 
distributions, and for the distributions of the mean, variance, and other cumu- 
lants as estimated from samples. 
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TABLE OF PERCENTAGE POINTS OF THE 
^DISTRIBUTION 

Computed by MAXINE HERRINGTON 


The following table has been derived from Miss Thompson’s Tables of Percentage Points of the 
Incomplete Beta Junction {Biomettika, 32, 168-81), by taking 

for the case v x = I. t is the usual “Student” ratio based on an estimate of variance having v t 
degrees of freedom. If tp is the quantity tabled, P/100 is the chance that 1 1 1 ^ tp, i.e. represents 
the area in the two tails of the ^-distribution. The table includes certain levels for t not previously 
available and should be accurate to the five significant figures shown. 


\ 

\p 

V \ 

60 

25 

10 

6 

2-5 

1 

0-6 

1 

1-00000 

2-4142 

0-3138 

12-706 

26-452 

63-657 

127-32 

2 

0-81660 

1-6036 

2-9200 

4-3027 

6-2053 

9-9248 

14-089 

3 

0-76489 

1-4226 

2*3534 

3-1825 

4-1765 

5-8409 

7-4533 

4 

0-74070 

1-3444 

2-1318 

2-7764 

3-4954 

4-6041 

5-5976 

6 

0-72669 

1-3009 

2-0150 

2-6706 

3-1634 

4-0321 

4-7733 

6 

0-71766 

1-2733 

1-9432 

2-4469 

2-9687 

3-7074 

4-3168 

7 

0-71114 

1-2543 

1-8946 

2-3646 

2-8412 

3-4995 

4-0293 

8 

0-70039 

1-2403 

1-8595 

2-3060 

2-7615 

3-3654 

3-8325 

9 

0-70272 

1-2297 

1-8331 

2-2622 

2-6850 

3-2498 

3-6897 

10 

0-69981 

1-2213 

1-8125 

2-2281 

2-6338 

3-1693 

3-6814 

11 

0-69746 

1-2145 

1-7959 

2-2010 

2-5931 

3-1058 

3-4966 

12 

0-69648 

1-2089 

1-7823 

2-1788 

2-5600 

3-0645 

3-4284 

13 

0-69384 

1-2041 

1-7709 

2-1604 

2-5326 

3-0123 

3-3725 

14 

0-69242 

1-2001 

1-7613 

2-1448 

2-5096 

2-9768 

3-3257 

16 

0-69120 

1-1967 

1-7530 

2-1316 

2-4899 

2-9467 

3-2860 

16 

0-69013 

1-1937 

1-7459 

2-1199 

2-4729 

2-9208 

3-2526 

17 

0-68919 

1-1910 

1-7396 

2-1098 

2-4581 

2-8982 

3-2225 

18 

0-68837 

1-1887 

1-7341 

2-1009 

2-4450 

2-8784 

3-1906 

19 

0-68763 

1-1866 

1-7291 

2-0930 

2-4334 

2-8609 

3-1737 

20 

0-68096 

1-1848 

1-7247 

2-0860 

2-4231 

2-8453 

3-1634 

21 

0-68635 

1-1831 

1-7207 

2-0796 

2-4138 

2-8314 

3-1352 

22 

0-68580 

1-1816 

1-7171 

2-0739 

2-4055 

2-8188 

3-1188 

23 

0-68531 

1-1802 

1-7139 

2-0687 

2-3979 

2-8073 

3-1040 

24 

0-68486 

1-1789 

1-7109 

2-0639 

2-3910 

2-7969 

3-0905 

25 

0-68443 

1-1777 

1-7081 

2-0595 

2-3846 

2-7874 

3-0782 

26 

0-68406 

1-1766 

1-7050 

2-0655 

2-3788 

2-7787 

3-0669 

27 

0-68370 

1-1757 

1-7033 

2-0518 

2-3734 

2-7707 

3-0565 

28 

0-68335 

1-1748 

1-7011 

2-0484 

2-3685 

2-7633 

3-0469 

29 

0-68304 

1-1739 

1-6991 

2-0452 

2-3638 

2-7564 

3-0380 

30 

0-68276 

1-1731 

1-6973 

2-0423 

2-3596 

2-7500 

3-0298 

40 

0-68066 


1-6839 

2-0211 

2-3280 

2-7045 

2-9712 

60 

0-67862 

1-1616 

1-6707 

2-0003 

2-2991 

2-0603 

2-9146 

120 

0-67656 

1-1559 

1-6577 

1-9799 

2-2699 

2-6174 

2-8599 

oo 

0-67449 


1-0449 

1-9600 

2-2414 

2-5758 

2-8070 









THE PROBABILITY INTEGRAL OF THE RANGE IN SAMPLES 
OF n OBSERVATIONS FROM A NORMAL POPULATION 

I. FOREWORD AND TABLES 
By E. S. PEARSON 


1. Scope of the main table 

Denote by x%, x % , x n a random sample of n observations, arranged in ascending order of 
magnitude, drawn from a normal or Gaussian population having for probability law 

^ ) = V^U exp [ -i ^] (1) 

where fi and <r are respectively the mean and standard deviation of the population. The 
range, sometimes described as the spread, in the sample is x n - x ± and we shall write the ratio 
of the range to the population standard deviation as 


w = 


«»-Si 

cr 


( 2 ) 


No simple expression exists for the probability law f n (w) of w, but Table 1 below gives 
for specific values IF of w computed values of the probability integral 

rw 

P n( W) - f„(w)dw (3) 

J 0 

This expression is the chance that the range in a sample of n observations is less than a 
given multiple of the population standard deviation. The table has been calculated for 
samples with n lying between 2 and 20 and for intervals of 0-05 of W. The values of the 
integral are given to 4-decimal place accuracy; linear interpolation is adequate except in 
the neighbourhood of the two quartiles in eaoh column. The method of calculation is 
described below by Dr H. 0. Hartley in a. separate section. 


2. Auxiliary table for special uses (Table 2) 

When dealing with samples containing only a small number of observations the range or 
spread may often be usefully employed as a measure of dispersion in place either of ftip 
standard (root mean square) deviation or the mean deviation. For example, an estimate of 
the population standard deviation may be obtained by multiplying the range in a single 
sample or the mean range in a number of samples by the factors a„ shown in Table 2. The 
accuracy of this form of estimation of cr compared with that of other methods has been 
discussed by Davies & Pearson (1934). 

In other circumstances it may be useful to plot in serial order on a control chart the 
values of range obtained from successive samples, e.g. when dealing with the control of 
quality in mass production. For this purpose it is necessary to know certain standard prob¬ 
ability levels for w which will serve as control limits*. Twelve of these levels expressed as 
percentage points and obtained by interpolation in the main Table 1, are shown in Table 2. 
They replace approximate limits published a few years ago (Pearson, 1932). It will be found, 
however, that except in the case n — 3f, the discrepancies between the two tables are all 
small. As an example, the table shows that if samples of 7 observations are randomly drawn 
from a population with a standard deviation cr, then in the long run only 6 % of these should 
have a range greater than 4-17<x, while 98 % should satisfy the inequality 

1 -25(T < x„ — < 4’ 49cr 

Probability levels for samples'with »> 12 have not been included as the use of range for 
control purposes in larger samples is of doubtful value. 

* For a discussion of the use of range in problems of industrial quality control see Reports 
issued by the British Standards Institution (Pearson, 1935, pp. 89-90; Dudding & Jennett, 1942). 

t Correct values for the ease n —3 were given by McKay & Pearson (1933). 


20-3 



Table I. Probability integral of the range W in normal samples of size n 


\. n 
TT \ 

2 

3 

4 

5 

6 

7 

8 

9 

10 

000 

0-0000 

O '0000 








O '05 

■0282 

•0007 

0-0000 







O '10 

•0584 

•0028 

•0001 







015 

•0845 

•0062 

•0004 

0-0000 






020 

•1125 

•0110 

•0010 

•0001 






0'25 

O '1403 

0-0171 

0-0020 

0-0002 






0-30 

•1680 

•0245 

•0034 

■0004 

0-0000 





0-35 

■1955 

•0332 

•0053 

•0008 

UIIIHB 





O '40 

•2227 

■0431 

•0079 

•0014 


IflfiTililiM 




0-45 

•2497 

•0543 

■0111 

•0022 

3 

•0001 




0 50 

0-2763 

0-0666 

0-0152 

0-0033 

0-0007 


0-0000 



0-55 

■3027 

•0800 

•0200 

•0048 

•0011 


•0001 



0-60 

•3286 

•0944 

•0257 

•0068 

Willi V A 


•0001 

0-0000 


0-65 

•3542 

•1099 

•0323 

•0092 






0-70 

•3794 

•1263 

•0398 

•0121 

SilMI 

•0011 

•0003 


0-0000 

0'75 

0-4041 

O '1436 

0-0483 

0-0157 

0-0050 

0-0016 

0-0005 

0-0002 

0-0001 

0 80 

•4284 

•1616 

-0578 

■0200 

IliailiTSKl 




•0001 

0-85 

•4522 

•1806 

•0682 

•0250 

iHiM’lil 

•0032 

•0011 


HiliTI 

0-90 

•4755 

•2000 

-0797 

•0309 

Aim 

■STilEl 

Mini ) (si 


fiffi. III 

0-95 

•4983 

•2201 

•0922 

•0375 

•0150 

wMjiJ 



•0003 

100 

0-5205 

0-2407 

0-1057 

0-0450 


0 0078 

0-0032 

0 0013 

0-0005 

105 

■5422 

•2618 

•1201 

•0535 

•0234 

•0101 

iHiiiF&fl 


■0008 

110 

•5633 

■2833 

■1355 

•0629 


■0129 

BiSiilvl 

•0026 


1 15 

■5839 

•3051 

■1517 

■0733 


•0163 

Aiii/iJ 

•0035 

B iitl 

1'20 

•6039 

•3272 

•1688 

•0847 

•0417 

•0203 

■0098 


Kjgl 

1'25 

0-6232 

0-3495 

0-1868 

0-0970 

0-0495 

o 

o 

6 

0-0125 

0-0062 

0 0030 

1-30 

•6420 

•3719 

•2054 

•1104 

•0583 

•0304 

•0157 



1-35 

■6002 

•3943 

•2248 

•1247 

•0680 

■0366 

•0195 

•0103 

•0054 

1 40 

■6778 

■4168 

■2448 

•1400 

•0787 

•0437 

•0240 

•0131 

Mb ‘kfl 

1-45 

•6948 

•4392 

-2654 

•1562 

•0904 


•0292 

■0164 


1-50 

0-7112 

0-4614 

0-2865 

0-1733 

0-1031 

0-0606 

0-0353 

0-0204 

0-0117 

155 

•7269 

•4835 

•3080 

•1913 

•1168 

•0705 

•0422 


■ ttf jl 

1-60 

•7421 

•5053 

•3299 

•2101 

•1316 


■0499 

•0304 

loti 

1-65 

•7567 

•5269 

•3521 

•2296 

•1473 


•0587 

•0366 

■ 313 

1-70 

■7707 

•5481 

•3745 

•2498 

•1639 

■1064 

•0684 

■0437 

•0277 

1-75 

0-7841 

05690 

0-3971 

0-2706 

0-1815 

0-1204 

0-0792 

00517 

0 0336 

1-80 

•7969 

■5894 

■4197 

•2920 

•2000 

•1355 

•0910 


'■‘ 71 

1-85 

•8092 

•6094 

•4423 

■3138 

•2193 

•1516 

•1039 


•0479 

1-90 

■8209 

•6290 

•4649 

■3361 

•2394 

•1086 

•1178 

■0818 

•0565 

1-95 

•8321 

•6480 

•4874 

•3587 

•2602 

•1867 

•1329 

•0940 

•0661 

200 

0-8427 

0 - C 665 

0-5096 

0-3816 

0-2816 

0-2056 

0-1489 

0-1072 

0-0768 

205 

•8528 

•6845 

•5317 

•4046 

•3035 

•2254 

•1661 

•1216 

•0886 

210 

■8624 

•7019 

•5534 

•4277 

•3260 

•2460 

•1842 

•1371 

•1015 

215 

•8716 

•7187 

•5748 

•4508 

•3489 

•2673 

•2033 

•1536 

•1156 

2-20 

•8802 

•7349 

■5957 

•4739 

•3720 

•2893 

■2232 

•1712 


2-25 

0-8884 

0-7505 

0-6163 

0-4969 

0-3956 

0-3118 

0-2440 

0-1899 

0-1470 

2-30 

•8961 

■7655 

•6363 

•5196 

•4190 

•3348 

•2656 

•2095 

•1645 

2-35 

•9034 

•7799 

•6558 

•5421 

■4427 

•3582 

•2878 


•1830 

2-40 

■9103 

•7937 

•6748 

•5643 

•4663 

•3820 

•3107 

•2514 

■2025 

2-45 

•9168 

•8069 

•6932 

•5861 

•4899 


•3341 

•2735 

B 

2-50 

0-9229 

0-8195 

0-7110 

0-6075 

0-5132 

0-4300 

0-3579 

0'2964 

0-2443 
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Table 1 (cont.). Probability integral of the range W in normal samples of size n 


\. n 
TF \ 

2 

3 

4 

5 

6 

7 

8 

9 

10 

2'50 

0-9229 

0-8195 

0-7110 

0-6075 

0-5132 

0-4300 

0-3579 

0-2964 

0-2443 

2-55 

•9286 

•8315 

•7282 

•6283 

•5364 

•4541 

•3820 

•3198 

■2665 

260 

•9340 

•8429 

•7448 

•6487 

•5592 

■4782 

•4064 

■3437 

•2894 

2-65 

•9390 

■8537 

•7607 

•6685 

•5816 

•5022 

•4309 

•3680 

•3130 

2-70 

•9438 

•8040 

•7759 

■6877 

•6036 

■5259 

•4555 

•3927 

•3372 

2-75 

0-9482 

0-8737 

0-7906 

0-7063 

0-6252 

0-5494 

0-4801 

0-4175 

0-3617 

2-80 

■9623 

•8828 

•8045 

■7242 

■6461 

•5725 

•5044 

•4425 

•3867 

2-85 

•9561 

•8915 

•8177 

•7415 

•6666 

•5952 

•5286 

•4675 

•4119 

2-90 

•9597 

•8996 

•8304 

•7580 

•6863 

•6174 

•5526 

•4923 

•4372 

2-95 

•9630 

•9073 

•8424 

■7739 

•7055 

•6390 

•5760 

•5171 

•4625 

3 00 

0-9661 

0-9145 

0-8537 

0-7891 

0-7239 

0-6601 

0.6991 

0-5415 

0-4878 

305 

•9690 

•9212 

■8645 

•8036 

■7416 

•6806 

•6216 

•5656 

•5129 

310 

•9716 

•9275 

•8746 

•8174 

•7587 

•7003 

•6430 

•5892 

•5378 

3-15 

•9741 

•9334 

■8842 

•8305 

■7750 

•7194 

•6649 

•6124 

•5623 

3-20 

•9763 

•9388 

•8931 

•8429 

•7906 

•7377 

•6856 

•6350 

•5864 

3-25 

0-9784 

0-9439 

0-9016 

0-8546 

0-8053 

0’7553 

0-7055 

Q -6569 

0-6099 

3-30 

•9804 

•9487 

•9095 

•8657 

•8194 

•7721 

•7248 

•6782 

-6329 

3-35 

•9822 

•9531 

•9168 

•8761 

•8327 

•7881 

•7432 

•6988 

•6553 

3-40 

-9838 

•9572 

■9237 

•8859 

■8454 

•8034 

•7609 

•7186 

•6769 

3-45 

•9853 

■9609 

•9302 

•8951 

•8573 

•8179 

•7778 

•7376 

•6978 

3-50 

0-9867 

0-9644 

0 0361 

0-9037 

0-8685 

0-8316 

0-7939 

0-7558 

0-7180 

3-55 

•9879 

•9677 

•9417 

•9117 

•8790 

■8446 

•8091 

•7732 

•7373 

3-60 

•9891 

•9706 

•9468 

•9192 

•8889 

■8668 

•8236 

•7898 

•7558 

3-65 

■9901 

•9734 

■9516 

•9261 

•8981 

■8683 

•8372 

■8055 

•7735 

3-70 

•9911 

•9759 

•9559 

•9326 

•9067 

•8790 

•8501 

•8204 

•7902 

3-75 

0-9920 

0-9782 

0-9600 

0-9386 

09148 

0-8891 

0-8622 

0-8345 

0-8062 

3-80 

•9928 

•9803 

•9637 

-9441 

•9222 

•8985 

■8736 

•8477 

-8212 

3-85 

■9935 

•9822 

-9672 

•9493 

•9291 

•9073 

•8842 

•8602 

-8355 

3-90 

-9942 

•9839 

•9703 

•9540 

•9355 

■9165 

■8941 

•8718 

•8488 

3-95 

•9948 

•9856 

•9732 

■9683 

•9415 

•9230 

•9034 

•8827 

-8614 

400 

0-9953 

0-9870 

0-9768 

0-9623 

0-9469 

0-9300 

0-9120 

0-8929 

0-8731 

4-05 

•9958 

■9883 

•9782 

•9660 

•9519 

•9366 

•9199 

•9024 

•8841 

4-10 

■9063 

•9805 

•9804 

•9693 

•9566 

■9425 

•9273 

■9112 

•8943 

415 

•9967 

■9906 

•9824 

•9724 

•9608 

•9480 

•9341 

•9193 

•9038 

4-20 

•9970 

•9916 

•9842 

•9752 

•9647 

•9530 

-9404 

•9269 

•9126 

4-25 

0-9974 

0-9925 

0-9859 

0-9777 

0-9682 

0’9576 

0-9461 

0-9338 

0-9208 

4-30 

•9976 

•9933 

■9874 

•9800 

•9715 

•9619 

•9514 

•9402 

•9283 

4-35 

•9979 

•9941 

•9887 

•9821 

■9744 

•9657 

•9562 

•9460 

•9352 

4-40 

•9981 

•9947 

•9899 

■9840 

•9771 

•9692 

•9607 

•9514 

•9416 

4-45 

•9984 

•9953 

•9910 

•9857 

•9795 

•9724 

■9647 

•9563 

•9474 

4-50 

0-9985 

0-9958 

0-9920 

0-9873 

0-9817 

0-9764 

0-9684 

0-9608 

0-9527 

4-55 

■9987 

•9963 

•9929 

•9887 

•9837 

•9780 

•9717 

•9649 

•9575 

4-60 

•9989 

•9967 

•9937 

•9899 

■9855 

•9804 

•9747 

•9686 

•9620 

4-65 

•9990 

•9971 

•9944 

■9911 

•9871 

•9825 

•9775 

•9719 

•9660 

4-70 

•9991 

•9974 

•9961 

•9921 

•9885 

•9845 

•9799 

■9750 

•9696 

4-75 

09992 

0-9977 

0-9956 

0-9930 

0-9898 

0-9862 

0-9822 

0-9777 

0-9729 

4-80 

•9993 

•9980 

■9962 

■9938 

■9910 

•9878 

•9842 

•9802 

■9759 

4-85 

•9994 

■9983 

■9966 

•9945 

•9920 

•9892 

■9800 

•9824 

•9786 

4-90 

•9995 

•9985 

•9970 

•9952 

•9930 

•9904 

•9876 

•9844 

•9810 

4-95 

•9996 

•9987 

•9974 

•9958 

•9938 

•9916 

•9890 

•9862 

•9832 

500 

0-9996 

0-9988 

0-9977 

0-9963 

0-9946 

0-9926 

0-9903 

0-9878 

0-9851 
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Table 1 (cont.). Probability integral of the range W in normal samples of size n 


\ n 

w\ 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 

2-50 

0-2007 

0-1644 

0-1342 

0-1094 

0-0890 

0-0722 

0’0686 

0-0474 

0-0383 

0-0309 

2-55 

•2213 

• 1 S 33 

•1514 

•1247 

•1026 

•0842 

•0690 

•0566 

•0462 

•0377 

260 

•2429 

•2033 

•1697 

■1413 

•1174 

■0974 

•0807 

•0668 

•0552 

•0465 

2-65 

• 26 G 3 

•2243 

•1891 

•1591 

•1336 

■1120 

•0937 

•0783 

•0654 

•0545 

2'70 

■2885 

•2462 

•2096 

•1780 

•1609 

•1278 

•1080 

•0911 

■0768 

•0647 

2-75 

0-3124 

0-2690 

0-2311 

0-1981 

0-1696 

0-1449 

0-1236 

O '1053 

0-0896 

0-0761 

2-80 

•3368 

■2926 

•2536 

•2194 

•1894 

•1632 

•1405 

•1208 

•1037 

•0889 

2-85 

•3617 

•3169 

•2770 

•2416 

•2103 

•1829 

•1587 

•1376 

•1192 

•1031 

2-90 

■3870 

•3417 

•3011 

•2647 

•2324 

•2036 

•1782 

•1668 

•1360 

■1186 

2-95 

•4126 

•3670 

•3268 

■2887 

•2554 

•2265 

•1989 

•1752 

•1642 

•1355 

300 

0-4382 

0-3927 

0-3512 

0-3134 

0-2792 

0-2484 

0-2207 

0-1959 

0-1737 

0-1538 

305 

•4639 

•4186 

•3769 

■3387 

•3039 

•2723 

•2436 

•2178 

•1944 

•1734 

310 

•4896 

•4446 

•4029 

•3645 

•3292 

•2970 

•2675 

•2407 

■2164 

■1943 

315 

■5150 

•4706 

■4292 

•3907 

•3551 

■3224 

•2923 

•2647 

•2394 

•2164 

3-20 

•5401 

•4965 

•4555 

•4171 

•3814 

•3483 

•3177 

•2895 

•2635 

•2396 

3-25 

0-5649 

0-6222 

0-4817 

0-4437 

0-4081 

0-3748 

0-3438 

0-3151 

0-2885 

0-2638 

3-30 

•5893 

•5475 

•6078 

•4703 

•4348 

•4016 

•3704 

•3413 

•3142 

•2890 

3-35 

•6131 

•6726 

•6337 

■4967 

•4617 

•4286 

•3974 

■3681 

•3407 

•3150 

3-40 

•6303 

■5970 

•6692 

•5230 

■4885 

•4557 

•4246 

•3953 

•3677 

•3417 

345 

■6589 

•6209 

•5842 

•6489 

•5151 

•4827 

•4519 

•4227 

•3950 

■3689 

3-50 

0-6807 

0-6442 

0-6087 

O '6744 

0-5413 

0-6096 

0-4792 

0-4502 

0-4226 

0-3964 

3-55 

•7017 

•6608 

•0326 

•5994 

•5672 

•5362 

•5063 

•4777 

•4504 

•4242 

3-60 

•7220 

•6886 

■6558 

•6237 

•5926 

•5624 

•5332 

•5051 

•4781 

■4522 

365 

•7414 

•7096 

•6782 

■6474 

•0173 

•5881 

•5596 

•5321 

•5056 

•4801 

3-70 

■7600 

•7298 

•6998 

■6704 

•6414 

•6132 

•5856 

•5588 

•5329 

•5078 

3-75 

0-7776 

0-7491 

0-7206 

0'6925 

0-6648 

0-6376 

0-6110 

0-5850 

0-5598 

0-5352 

3-80 

•7944 

•7676 

•7406 

■7138 

■6873 

•6613 

■6357 

•6106 

•5861 

•5022 

3-85 

•8103 

■7860 

•7696 

•7342 

•7090 

•6841 

•6596 

•6355 

•6118 

•6887 

3-90 

■8254 

•8016 

•7777 

■7637 

•7298 

-7061 

■6827 

•6596 

■6369 

•6145 

3-95 

•8395 

•8173 

•7948 

•7723 

•7497 

■7273 

•7050 

■6829 

•6611 

•6397 

400 

0-8528 

0-8321 

0-8111 

0-7899 

0-7686 

0-7474 

0-7263 

0-7053 

0-6845 

0'6640 

405 

•8653 

•8460 

•8264 

•8065 

•7866 

•7666 

•7466 

■7268 

•7070 

■6874 

410 

•8769 

•8590 

•8408 

•8223 

•8036 

•7848 

•7660 

•7472 

•7285 

•7099 

415 

•8878 

•8712 

•8543 

■8371 

•8196 

■8021 

•7844 

•7667 

•7491 

•7315 

4-20 

•8978 

•8826 

•8669 

•8509 

•8347 

•8183 

•8018 

•7852 

•7086 

•7520 

4-25 

0-9072 

0-8931 

0'8787 

0-8639 

0-8488 

0-8336 

0-8182 

0-8027 

0-7871 

0-7715 

4 30 

•9159 

•9029 

•8896 

•8760 

•8620 

•8479 

■8336 

•8191 

•8046 

•7899 

4-35 

■9238 

•9120 

•8998 

•8872 

•8744 

■8613 

■8480 

•8345 

•8210 

•8073 

440 

•9312 

•9204 

•9092 

•8976 

•8858 

■8737 

■8614 

•8490 

•8364 

•8237 

4-45 

•9379 

•9281 

•9178 

•9073 

•8964 

•8863 

•8740 

■8626 

•8508 

•8391 

4 50 

0-9441 

0-9352 

0-9258 

0-9162 

0-9062 

0’8960 

0-8866 

0-8750 

0-8643 

0-8534 

4-55 

•9498 

•9417 

■9332 

•9244 

•9153 

•9060 

•8964 

•8867 

•8768 

•8667 

4-60 

•9650 

•9476 

•9399 

•9319 

•9236 

■9151 

•9064 

■8975 

•8884 

•8791 

4-65 

•9597 

•9530 

•9460 

•9388 

•9313 

•9236 

•9156 

•9074 

•8991 

•8906 

4-70 

•9640 

•9579 

•9516 

•9451 

•9383 

•9312 

•9240 

•9165 

•9090 

■9012 

4-75 

0-9678 

09624 

0-9567 

0-9508 

0-9446 

0-9383 

0’9317 

0-9249 

0-9180 

0-9110 

4-80 

•9713 

■9665 

•9614 

•9500 

•9505 

■9447 

•9387 

■9326 

•9264 

•9199 

4-85 

•9745 

•9702 

•9656 

•9608 

•9558 

•9605 

■9452 

•9396 

■9340 

•9281 

490 

•9774 

•9735 

•9694 

•9650 

■9605 

■9559 

•9510 

•9460 

•9409 

■9356 

495 

•9799 

■9765 

•9728 

•9689 

•9649 

■9607 

•9663 

•9618 

•9472 

•9424 

500 

0 9822 

0-9791 

0-9759 

0-9724 

0-9688 

0-9650 

0-9611 

0-9571 

0-9529 

0-9486 
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Table 1 (cont.). Probability integral of the range W in normal samples of size n 


\ n 

w \ 

2 




6 

7 

8 

9 

10 

500 

0-9996 




0-9946 

0-9926 

0-9903 

0-9878 

0’9851 

505 

•9996 

•9990 

•9980 

•9967 

•9952 

•9935 

•9916 

•0893 

•9869 

510 

•9997 

•9991 

• 9982 

•9971 

•9958 

•9942 

•9925 

•9906 

•9884 

515 

•9997 

•9992 

•9985 

•9975 

■9963 

•9960 

•9934 

•9917 

•9898 

520 

•9998 

■9093 

•9986 

•9978 

•9968 

•9956 

•9942 

•9927 

•9911 

5-25 

0-9998 

0'9994 

0-9988 

0-9981 

0-9972 

0-9961 

0-9949 

0-9936 

0-9922 

5-30 

•9998 

•9995 

•9990 

•9983 

■9975 

•9966 

•9956 

•9944 

■9931 

5-35 

•9998 

•9996 

•9991 

•9985 

•9979 

•9971 

•9961 

•9951 

•9940 

5-40 

•9999 

•9996 

•9992 

•9987 

•9981 

•9974 

•9966 

•9957 

•9948 

5-45 

•9999 

•9997 

•9993 

•9989 

•9984 

•9978 

•9971 

■9963 

•9954 

5-50 

0-9999 

0-9997 

0-9994 

0-9991 

0-9986 

0-9981 

0-9975 

0-9968 

0-9960 

5-55 

•9999 

•9997 

•9996 

•9992 

■9988 

•9983 

•9978 

•9972 

•9965 

5-60 

-9999 

•9998 

•9996 

•9993 

■9989 

•9985 

•9981 

•9976 

•9970 

5-65 

-9999 

•9998 

•9996 

•9994 

■9991 

•9987 

•9983 

•9979 

•9974 

5-70 

0-9999 

•9998 

•9997 

•9995 

•9992 

•9989 

■9986 

•9982 

•9977 

5-75 

1-0000 

0-9999 

0-9997 

0-9995 

0-9993 

0-9991 

0-9988 

0-9984 

0-9981 

5-80 


•9990 

•9998 

•9996 

■9994 

•9992 

•9989 

•9986 

•9983 

5-85 


■9999 

•9998 

•9997 

•9995 

•9993 

•9991 

•9988 

■9986 

5-00 


•9999 

■9998 

•9997 

•9996 

•9994 

■9992 

■9990 

•9988 

5-95 


•9999 

■9998 

•9998 

•9996 

■9996 

■9993 

•9991 

•9989 

600 


0-9999 

0-9999 

0-9998 

0-9997 

0-9996 

0-9994 

0-9993 

0-9991 

6-05 


•9999 

•9999 

•9998 

•9997 

•9996 

•9995 

■9994 

•9992 

610 


0'9999 

■9999 

•9998 

■9998 

•9997 

•9996 

•9995 

•9993 

6-15 


1-0000 

■9999 

•9999 

■9998 

■9997 

•9996 

■9995 

■9994 

6-20 



•9999 

•9999 

■9998 

•9998 

■9997 

•9996 

•9995 

6-25 



0-9999 

0-9999 

0-9999 

0-9998 

0-9997 

0-9997 

0-9996 

6-30 



0'9999 

•9999 

•9999 

■9998 

•9998 

■9997 

•9996 

6-35 



1-0000 

•9999 

•9999 

•9999 

■9998 

-9998 

■9997 

ft -40 




09999 

■9999 

•9999 

■9998 

■9998 

•9997 

6-45 




1-0000 

•9999 

•9999 

•9999 

•9998 

•9998 

6 50 





09999 

0-9999 

0-9999 

0-9999 

0-9998 

6-55 





0-9999 

•9999 

•9999 

•9999 

•9998 

6 60 





1-0000 

•9999 

•9999 

•9999 

-9999 

6-65 






0-9999 

•9999 

•9999 

•9999 

6 70 






1-0000 

0-9999 

•9999 

•9999 

6-75 







1-0000 

0-9999 

0 9999 

6-80 








0-9999 

•9999 

6-85 








1-0000 

■9999 

6-90 









0-9999 

6-95 









1-0000 

7-00 










705 










710 










7-15 










7-20 










7-25 
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Table 1 ( eont .). Probability integral of the range W in normal samples of size n 


\ n 

W \ 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 

500 

0-9822 

0-9791 

0 - 97 S 9 

0-9724 

0-9688 

0-9650 

0-9611 

0-9571 

0-9529 

0-9486 

5-05 

•9843 

•9815 

•9786 

•9756 

•9723 

•9690 

■9655 

•9618 

•9681 

•9543 

5-10 

•9861 

•9837 

•9811 

•9784 

•9755 

•9725 

•9694 

•9661 

•0628 

•9593 

5-15 

■9878 

•9866 

•9833 

•9809 

•9783 

•9757 

•9729 

•9700 

•9670 

•9639 

520 

•9893 

•9874 

■9853 

■9832 

•9809 

•9785 

•9760 

•9736 

•9708 

•9681 

5-25 

0-9906 

0-9889 

0-9871 

0-9852 

0-9832 

0-9811 

0-9789 

0-9766 

0-9742 

0-9718 

5-30 

•9917 

•9903 

•9887 

•9870 

•9852 

■9833 

•9814 

•9794 

•9773 

•9761 

5-35 

•9928 

•9915 

•9901 

•9886 

•9870 

•9854 

•9836 

•9819 

•9800 

•9781 

5-40 

•9937 

•9925 

•9913 

•9900 

•9886 

•9872 

•9856 

•9841 

•9824 

•9807 

5-45 

■9945 

•9935 

•9924 

•9912 

•9900 

•9888 

•9874 

•9860 

•9846 

•9831 

5-50 

0-9952 

0-9943 

0-9934 

0-9924 

0-9913 

0-9902 

0’9890 

0-9878 

0-9865 

0-9852 

5-55 

•9958 

•9951 

•9942 

■9933 

•9924 

•9914 

•9904 

•9893 

•9882 

•9870 

5-60 

•9964 

•9957 

•9950 

■9942 

■9934 

•9925 

•9916 

•9907 

•9897 

•9887 

5-65 

•9969 

•9963 

•9956 

•9950 

■9943 

■9936 

•9927 

•9919 

•9910 

■9901 

5-70 

•9973 

•9968 

•9962 

■9956 

•9960 

■9944 

■9937 

•9929 

•9922 

• 9914 

5-75 

0-9976 

0-9972 

0-9967 

0-9962 

0-9957 

0-9951 

0-9945 

0-9939 

0-9932 

0-9925 

5-80 

•9980 

•9976 

•9972 

■9967 

•9963 

•9958 

•9952 

■9947 

•9941 

•9935 

5'85 

•9982 

•9979 

•9976 

■9972 

■9968 

•9963 

•9959 

■9964 

•9949 

•9944 

5-90 

•9985 

■9982 

•9979 

•9976 

•9972 

•9968 

■9964 

■9960 

•9956 

•9952 

595 

•9987 

•9985 

■9982 

■9979 

•9976 

•9973 

•9969 

•9966 

•9962 

•9958 

600 

0-9989 

0-9987 

0-9984 

0-9982 

0-9979 

0-9977 

0-9974 

0-9971 

0-9907 

0-9964 

605 

•9990 

•9989 

•9987 

■9984 

•9982 

•9980 

■9977 

•9975 

•9972 

•9969 

6-10 

-9992 

•9990 

•9989 

■9987 

•9985 

■9983 

•9981 

•9978 

•9976 

•9973 

615 

•9993 

■9992 

•9990 

■9989 

•9987 

•9986 

•9983 

•9981 

■9979 

■9977 

6-20 

■9994 

•9993 

•9992 

•9990 

•9989 

•9987 

■9986 

■9984 

•9982 

•9980 

6-25 

0-9995 

0-9994 

0-9993 

0-9992 

0-9991 

0-9989 

0-9988 

0-9986 

0-9985 

0’9983 

6-30 

■9996 

•9996 

■9994 

■9993 

•9992 

•9991 

■9990 

•9988 

■9987 

•9986 

635 

•9996 

•9996 

•9995 

•9994 

•9993 

•9992 

■9991 

•9990 

•9989 

•9988 

6-40 

■9997 

•9996 

•9996 

•9995 

•9994 

■9993 

•9992 

•9992 

•9991 

•9990 

6-45 

•9997 

•9997 

•9996 

•9996 

•9995 

•9994 

•9994 

•9993 

•9992 

■9991 

6-50 

0-9998 

0-9997 

0-9997 

0-9996 

0-9996 

0-9995 

0-9995 

0-9994 

0-9993 

0-9993 

6-55 

•9998 

•9998 

•9997 

■9997 

•9996 

■9996 

•9995 

•9995 

•9994 

•9994 

6-60 

•9998 

■9998 

•9998 

•9997 

•9997 

■9997 

•9996 

•9990 

•9995 

■9995 

6-65 

•9999 

■9998 

•9998 

•9998 

■9997 

•9997 

•9997 

■9996 

•9996 

•9995 

6-70 

•9999 

•9999 

•9998 

-9998 

•9998 

■9998 

•9997 

•9997 

■9997 

•9996 

6-75 

0-9999 

0-9999 

0-9999 

0-9999 

0-9998 

0-9998 

0-9998 

0-9997 

0-9997 

0-9997 

6-80 

•9999 

•9999 

•9999 

•9999 

■9998 

•9998 

•9998 

•9998 

•9908 

•9997 

6-85 

•9999 

•9999 

•9999 

•9999 

■9999 

•9999 

•9998 

•9998 

•9998 

•9998 

690 

0-9999 

•9999 

■9999 

■9999 

•9999 

•9999 

•9999 

• 99 b 8 

•9998 

•9998 

695 

1-0000 

0-9999 

•9999 

•9999 

•9999 

•9999 

•9999 

•9999 

■9999 

•9998 

700 
705 
7-10 
7-15 
7-20 

7-25 


1-0000 

0-9999 

0-9999 

1-0000 

0-9999 

0-9999 

1-0000 

0-9999 

•9999 

0-9999 

1-0000 

0-9909 

•9999 

0-9999 

1-0000 

0-9999 

•9999 

•9999 

0-9999 

1-0000 

0-9999 

•9999 

•9999 

0-9999 

1-0000 

0-9999 

•9999 

■9999 

•9999 

09999 

1 0000 

0-9999 

■9999 

•9999 

•9999 

0-9999 

1-0000 
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E. S. Pearson 


3. The origin of the ‘present tables 

Tables giving the expected or mean value and the standard deviation of range in random 
samples from the normal population of equation (1) were calculated by L. H. C. Tippett 
(1925) in the Department of Applied Statistics, University College, London. Since the 
probability distribution fjy>) is itself far from normal in form, it Was evident that its mean 
and standard deviation alone would not provide all the information generally needed in 
practice. Tippett included in his paper some values of the constants ^ and /? 2 of the dis¬ 
tribution and his-work was extended by the present writer (Pearson, 1926, 1932) who 

Table 2 


Size 

of 

sample 

n 

Factor 

«« 

Lower percentage points 

Upper percentage points 

0-1 

0-5 

1-0 

2-5 

5-0 

10-0 

10-0 

5-0 

2-6 

1-0 

0-5 

0-1 

2 

0-8862 

0-00 

001 

0-02 

0-04 


MW 

2-33 

2-77 

3-17 

3-64 

3-97 

4-66 

3 

0-5908 

0-00 

0-13 

0-19 

0-30 



WA\M 

3-31 

3-68 

4-12 

4-42 

5-06 

4 

0-4857 

0-20 

0-34 

0-43 

0-59 

0-76 

0-98 

3-24 

3-63 

3-98 

Kfol 

4-69 

5-31 

5 

0-4299 

0-37 

0-55 

0-66 

0-86 


1-26 

3-48 

3-86 

4-20 

4-60 

4-89 

5-48 

6 

0-3946 

0-54 

0-75 

0-87 

1-06 

1-25 

1-49 

3-66 


4-36 


5-03 

5-62 

7 

0-3098 

0-69 

0-92 

1-05 

1-25 

144 

1-68 

3-81 

4-17 

4-49 

4-88 

5-15 

5-73 

8 

0-3512 

0-83 

1-08 

1-20 

1-41 

mm* 

1-83 

,111 

4-29 

4-61 


5-26 

5-82 

9 

0-3367 

.0-98 

1-21 

1-34 

1-65 

1-74 

1-97 

4-04 

4-39 



5-34 

5-90 

mm 

0-3249 

1-08 

1-33 

1-47 

1-67 

1-86 

2-09 

4-13 

4-47 

4-79 

5-16 

5-42 

6-97 

a 

0-3152 

1-20 

1-45 

1-58 

1-78 

1-97 

2-20 

4-21 

4-55 

4-86 

5-23 

5-49 

isa 

12 

0-3069 

1-30 

1-55 

1-68 

1-88 

2-07 

2-30 

4-29 

4-62 

4-92 

5-29 

5-54 

H 


Estimate of cr=a„x range (or mean range) in a sample of n observations. 


developed an approximate method of determining probability levels for w and provided 
some provisional tables of those. The need has, however, been felt for some time for a full 
and accurate table of the probability integral of the range tofit into place among otherfunda- 
mental tables associated with the normal distribution. The completion of this objective has 
been made possible by a grant from the Department of Scientific and Industrial Research, 
whose assistance in the matter is acknowledged with warm appreciation. Tire actual method 
of computation was planned by Dr H. 0. Hartley and the calculations w r ere carried out under 
his supervision by Scientific Computing Service, Ltd. The scope of the main table was 
limited to n < 20. As n increases beyond this value there is an increasing risk that the table 
may be misleading in practice, since f„{w) becomes very sensitive to relatively slight 
departures from normality in the tails of the population distribution. 


REFERENCES 

Davies, 0. L. & Peabson, E. S. (1934). J.R. Statist. Soa. Supp. 1, 76. 

Dudding, B. P. & Jennett, W. J. (1942). The Application of Statistical Methods to 
Quality Control. British Standards Institution, No. 600, (1942). 

MoKay, A. T. & Peabson, E. S. (1933). Biometrika, 25, 415. 

Peabson, E. S. (1926). Biometrika, 18, 173. 

- (1932). Biometrika, 24, 404. 

- (1936). The Application of Statistical Methods to Industrial Standardization and 

Quality Control, British Standards Institution, No. 600, (1935). 

Tippett, L. H. C. (1925). Biometrika, 17, 364. 











309 


II. NUMERICAL EVALUATION OF THE PROBABILITY INTEGRAL 

By H. 0. HARTLEY 

The formula used for the tabulation of the probability integral P„( W) of the range in normal 
samples of size n is given in the paper printed on pp. 334-48 below, where it proved that 


a -f-iTJ 7 \n /*co [ pu \n-l 

^z(x)dxj + 2nJ ^z(u) I z(x) ix\ du, 


( 1 ) 


where 


z(x) = (2ir)~i e~l x *. 


Certain properties of this formula and the facilities provided by certain modem calculating 
machines make this integral amenable to tabulation. 

The main work consists in the evaluation of the integral 


J «> / ru \ n-1 

2(w)| z{x)dx\ du, 
IW \J u-W ] 


( 2 ) 


by quadrature for a two variable' network of values of n and W. The range of integration is 
from %-W up to a point where the integrand 


z{u) 


J u- 


z(x) dx 
w / 


r 


m 


vanishes to 7-decimal accuracy.* For each point of the network n, W, therefore, the integrand 
(3) was tabulated for a set of equidistant values of u covering the range of integration. The 
interval of integration was chosen as Au = 0-2 throughout. This was sufficient to obtain 
about 6-decimal accuracy in the integral (2). 

The interval in W was taken as wide as possible but sufficiently fine to permit cheeking 
by differencing and the subsequent subtabulation of P„(W) to interval 0'05, which is the 
interval in the final table. An interval of AW = 0-26 was therefore chosen for the n, W 
network. 

For small values of W it was necessary to tabulate the integrand for all integers n for 
which P n (W) is required in the final table. For larger values of n and W, however, it was 
sufficient to calculate the final integral (1) for odd n and to obtain intermediate values by 
interpolation. Below, then, is shown the two variable network for which the integral (2) 
was produced by quadrature: 

W ~ 0-00 (0-26) 1-26 and n = 3 (1) 20. 

W = 1-60 (0-26) 2-76 n. = 3(1) 9 (2) 23. ( 

IF = 3-00 (0-26) 3-26 n = 3 (1) 6 (2) 23. 

W = 3-60 (0-26) 8-00 n= 3 (2) 23. 

For n — 2 the final integral P 2 ( W) is given directly by the normal integral and may be 
obtained by interpolation in Table II of Tables for Statisticians and Biometricians, Part I. 
Using the notation of that table (Sheppard’s original notation) we have 


(4) 


AW 




Moreover, for purposes of interpolation, use was made of the formal relation 

P 1 (1F) = 1 for TF>0. 

For fixed u and W and for values of n in the arithmetic progression (4), the integrand (3) 
is a geometric progression with , \ 2 

z(w)IJ %{x)dx\ 

* The integrand was calculated to 7-decimal aoouraey in order to obtain P n {W) to about 
5-deoimal accuracy. 
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as leading term .and | f z(x) dal or ( f z{x)dx j 

\Ju~w 7 \J n—W ! 

as common ratio. This leading term as well as the common ratios were easily obtained from 
Table II of Tables for Statisticians and Biometricians, Part I and the terms of the progression 
were then automatically produced on a Mercedes calculating machine Model 38 M.S. and 
copied down in two-way tables with u as row heading, n as column heading and W as 
table heading. The values of the integrand were then checked by differencing column-wise 
and added to yield the main term of the integral (2). The correction terms which, according 
to Gregory’s formula, convert the integrand-sum into the integral were calculated from the 
differences and checked by the application of Gauss’ formula of integration. Finally, to 
obtain P n {W) the term 


was produced by continued multiplication on the Mercedes and added to the corresponding 
integral (2) to yield P„( if 7 ) for all points of the above network. 

For odd values of n the integral P„(W) was,then differenced IP-wise on the National 
machine which, incidentally, produced column totals SP n (tP) for these values of n. Two 

W 

ohecks were applied at this stage. One consisted in inspecting the fourth order differences. 
As a second check, the mean range, w „, was calculated from the formula 

= 8 - ^P n (w)dw* 

and compared with the correct mean range given in Table XXII of Tables for Statisticians 
and Biometrioians, Part II. Finally, the function P n { IF) was subtabulated to interval 0-05 
on the National machine by a method similar to that described in detail by L. J. Comrie 
(1936). 

The values of P n (IF) for even n were then obtained by interpolation with the help of 
two interpolation formulae of Lagrangian type: 

2048P„(IF) = -S[P„. 7 (lP) + P„ + ,(IP)] + 49[P„_ 5 (IP) + P n+6 (lP)] 

-245[P n _ 3 (F)+P B+s (TP)] + 1225[P n _ 1 (IP)+P n+1 (IP)], (5) 

20P„( IP) = P n _ 3 ( IP) + P n+3 ( IP) - 6[P„_ a ( IP) +P B+a ( IP)] 

+ 15[P„- 1 (IP)+P„ +l (H0]. (6) 

Formula (5) yields the interpolate for even n from the given values of P„( W) at adjacent odd 
values of n. This formula was used throughout. In some cases, however, the resulting inter¬ 
polate was accurate to about 3 places of decimals only. In such cases values of P„_ s , P n+3 , 
Pn-i> Pn+i accurate to 6 places of decimals and values of P„_ a , P M+2 accurate to (say) 
3 places of decimals were substituted in formula (6). This yielded a ‘corrected value’ of 
P n (lP). The process was then repeated for n = n+ 2 and so on until all values of P„(W) 
had ‘settled down ’ for even values of n. It is easy to see that the process is convergent and 
that the maximum error in the interpolate is 2 units for the 6th decimal. 

After completion of the interpolation n-wise, the interpolates P n (IF) for even n were 
differenced IP-wise, checked and subtabulated as for odd values of a. 

* This is true provided P n { 8) = 1 to 6-decimal place accuracy. 


a +W \n 

vH 


REFERENCE 

Comrie , L. J . ( 1936 ). J.B. Statist. Soc. Suppl. 3, 87-114. 



NOTES ON TESTING STATISTICAL HYPOTHESES 
By E. S. PEARSON 


1. In July 1939, a few weeks before the opening of the present war, a Con¬ 
ference on the Application of the Calculus of Probabilities was held at Geneva 
under the auspices of the International Institute of Intellectual Co-operation 
(League of Nations). At the public session at which a paper by Prof. J. Neyman 
was presented and also subsequently in some informal discussions, a number 
of questions were raised: 

(a) In choosing a test for a statistical hypothesis, is it possible or even necessary 
to specify the hypotheses alternative to that tested? Why should not a test be 
made to depend only on the form of law associated with the hypothesis tested? 
For example, Newton’s hypothesis of gravitation was formulated and tested 
without any need to define alternative laws. 

(b) Is the method of approach to these problems advocated by Prof. Neyman 
and myself applicable to testing the appropriateness of probability laws or only 
to testing hypotheses regarding the numerical values of constants contained in 
these laws? 

After the conclusion of the conference, I set down some Notes for a few of the 
statisticians who had taken part in the discussions, hoping that at leisure they 
might feel stimulated to define their views on the subject more precisely. But 
almost before the Notes were despatched, war in Europe had intervened. The 
only reply which I received was from Prof. Gumbel, and this, after some un¬ 
avoidable delay, has now taken shape in the contribution printed on pp. 317-33 
below. In publishing this, it seems useful to add my own Notes, which are given 
with only minor verbal alterations in the following pages. They are in part a 
restatement with rather different emphasis of views expressed in a paper published 
four years ago (Pearson, 1938). 

2. With regard to one of the points raised under (a) above, it should be 
remembered that a statistical hypothesis as defined by Neyman and myself is 
a hypothesis concerning the probability law of random variables. The gravi¬ 
tational hypothesis of Newton is not a statistical hypothesis in the sense defined; 
statistical methods may be introduced to test the Newtonian hypothesis, how¬ 
ever, and they will involve tests of statistical hypotheses or ‘significance tests’ 
because it will be assumed that errors of observation exist which may be regarded 
as random variables, probably taken to follow the normal distribution law. 

For example, on the Newtonian hypothesis, the angular co-ordinates of a 
planet measured from the earth as origin may at certain moments be given as 
£ = rj = rj (t = 1,2,...). If we have a number of observations of position y, 
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subject to observational error, the statistical problem will be to test whether 
these are consistent with the hypothetical position values or whether they 
suggest that £, rj have some other different values at the moments of observation. 
Thus the ‘alternatives’ that we have immediately in mind will be alternative 
values for £, t], not alternative gravitational hypotheses. If, however, some alter¬ 
native law of motion were proposed, so that we could specify definite values 
ij'i alternative to the values ty of the Newtonian law, then undoubtedly we 
could choose a statistical test which would be particularly efficient in discrimi¬ 
nating between the two alternatives. Such a course became possible when the 
Einstein hypothesis was formulated and the orbit of Mercury considered. But 
the absence of an alternative gravitational law does not prevent us selecting a 
statistical test which will be (a) sensitive to departures in g, rj from g j5 rj t , but 
(6) relatively insensitive to departures from normality in the distribution of 
errors. We should make this selection because, if the Newtonianlaw were incorrect, 
we believe that this would result in a change in g <( Tj ( but not in a departure of the 
distribution of observational errors from the normal law. 

This example, of course, concerns a statistical hypothesis regarding the values 
of two parameters g, rj, not regarding the form of a probability law of random 
variables. The following general approach shows, however, that the principles 
discussed may be applied to testing hypotheses regarding probability laws. 

3. Suppose that £ is a continuous random variable and that H 0 is a statistical 
hypothesis which assumes that the elementary probability law for x is p(x j H 0 ) 
in the interval — oo to -f oo. Thus 

f ${x\H 0 )dx = 1. (1) 

J — 00 

Now write y - j p(x | H 0 ) dx. (2) 

J —CO 

y will be a non-decreasing function of x having values confined to the interval 
(0,1). Further, the elementary probability law of y will be 

v{y) - * 1 for ( 3 ) 

or all values of y between 0 and 1 are ‘equally probable’. 

Suppose now that we wish to use a set of n independent values aq, x 2 , ..., x n 
to test that the probability law is of the assumed form p(x | H 0 ). It is clear that 
the hypothesis H 0 is exactly equivalent to the hypothesis, say h 0 , that the n 
values y lt y 2 , ...,y n (obtained from the afs by the transformation (2)) have been 
sampled subject to the probability law (3). Just as the point (aq, aq,..., x n ) may 
be represented in an unlimited %-dimensioned space having probability density 

p(aq,aq, ...,x n \ H 0 ) = n Maq | H 0 )}, (4) 
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if II Q is true, so the point {y Q , y t ,..., y n ) may be placed in an ^-dimensioned hyper¬ 
cube with sides of unit length and with uniform probability density, if H 0 and 
therefore h 0 is true. It follows that if H 0 is what has been termed a 1 simple hypo¬ 
thesisi.e. specifies the form of p(x \ H 0 ) completely,* then the test of IJ 0 may 
always be transformed to the test of h Q . If then it were correct to say that the test 
of a statistical hypothesis depends only on the form of the law specified by H 0 , it 
follows that for the type of situation considered the testing of a statistical hypo¬ 
thesis could always be reduced to the following simple problem: 

To test whether a sample of n independent random variables y lt y 2 , ..., y n 
(0 < < 1) has been selected from the so-calied rectangular distribution, i.e. the 

distribution for which p(y) = 1, (0 < y =§ 1). 

4. We are at once faced, therefore, with the question of how to test this 
simple but apparently fundamental hypothesis. If h a is true, the sample point 
is equally likely to fall at any point within the ^-dimensioned hypercube. Thus 
in picking out the critical (or rejection) region in this space we can get no assist¬ 
ance whatsoever from the changes in probability density, as we might do in 
the rc-space. If we wish to use a level of significance of a (say a = O-Oljfor rejecting 
h 0 , it is clear that an infinite number of critical regions satisfying this condition 
are available; it is only necessary to select a region whose content is a. 

If we consider the n values of y and plot them in the interval (0,1) as follows, 

® • ® • s o & 

1—i- 1 - 1 - 1 -r—“i- 1 -r— r — —i—I 

Kg. l. 

the great majority of samples, from a rectangular distribution, at any rate if n 
is not too small, will be spread out fairly uniformly throughout the interval. 
Perhaps an ‘ideal’ sample against which to measure irregularities might be 
described as one for which the values of y fell at 

l_ _3_ _6_ 2n-l 

2 n’ 2 n’ 2 n’ 2 n 

But what form of departure from this ideal of uniformity are we to pick out as 
suggesting that the hypothesis h 0 is disproved? Should we judge significance by 
paying attention to the value of the mean y, of the variance, of the range of 
variation or of higher moments? Or should we use the y a or w a tests? It seems 
difficult to find any basis for choice which could be regarded in any sense as the 
1 best ’. For any set of values y v y z , .-.,y n some critical region of size a can always 
be found which will contain the sample point and therefore lead to the rejection 
of h 0 . Indeed, the task of selecting a unique region on any rational basis would 
seem to be insoluble. 

* This condition is important. If the values of certain constants contained in the probability 
law need to be estimated from the observations, then the n values of y will not form a true random 
sample from a rectangular distribution. They will be subject to certain limitations to their degrees 
of freedom, though these may be relatively unimportant if a is large. 
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5. Directly it is recognized, however, that the choice of a test of a statistical 
hypothesis depends on something more than the form of the law associated with 
that hypothesis, it can be seen, how a solution may he obtained.. If we can specify 
a single alternative H 1 to H 0 or a class of alternatives C(H), then we shall have 
also an alternative h , or a class C(h) to h 0 . Thus, ifp{% | Hj) denotes a probability 
law alternative to p(x \ H 0 ), then for y the alternative is 

for ocyci, (s) 

where }{y) means the solution of 

V = \p{x\H^)dx ( 6 ) 

with regard to x. For example, Fig. 2 shows three typical forms of alternative 
p(yjh 1 ), p(y | A a ) and p(y j h 3 ) associated with alternatives p(x\H 1 ) > ..., etc., to 
p{x\E 0 ), 



Solid curve represents p(x\H 0 ) 



0 10 10 


Solid rectangle represents p(yjh 0 ) 

Fig. 2. 

We can now see the kind of test which will be most efficient for testing H 0 
with regard to possible classes of alternatives. If the alternative laws are of 
smaller dispersion (as p(x | H^), we must be on the look-out for too many values 
of y near \ and too few near 0 and 1. For alternatives with greater dispersion 
(as p(x | H 2 )), we must reject H 0 when there are too many y’s near. 0 or 1 and too 
few near While if the alternatives are likely to be asymmetrical curves (as 
P( x I ^a))> then a different rule will be needed, as suggested by the p(y | h z ) curve. 

6. It follows that in so far as it is possible to formulate the class of admissible 
probability laws p(x | H), the problem of selecting the most efficient test of H 0 
reduces to that of choosing a critical region in the Ti-dimensioned hypercube 
which is most effective in detecting, from a sample of n values of y, differences 
between the rectangle p(y | A 0 ) and the appropriate alternative forms p(y | h). 



E. S. Pearson 31 5 

If H t is a single admissible alternative, then it has been shown (Neyman & 
Pearson, 1933, p. 298) that the region w a of content a in the hypercube, within 
which n 

n p(y i i h) 

V-<*. (7) 

n pdfi \K) 

i=l 

or, in view of equation (3), II J»(ft|*i)>j, (8) 

where k is chosen so that P{{y 1: y 2 ,..., y n ) ew 0 j h 0 ] = a (9) 

has the following property. 

Of all regions of content a, w 0 is more likely than any other to include the 
sample point when h v and not h 0 , is true. The region has been termed the best 
critical region for testing h 0 with regard to the alternative h v 

As soon as II a and H x are specified, clearly p(y \ h x ) and therefore the region 
w 0 can be found, although mathematically it may be rather difficult to determine 

n 

the Appropriate boundary II p(y x I h ,) = constant, so as to satisfy (8), Since this 

•£=1 

product is the probability density in the hypercube given by h x , it will be seen 
that what we set out to do is to include in the critical region those parts of the 
sample space where the density for h x is highest. It is here, on repeated sampling, 
that sample points would tend to be concentrated if h x is true, instead of being 
uniformly distributed as under h 0 . 

7. If instead of a single alternative h lt there is a class of admissible alter¬ 
natives C(h), there may or may not be common points of concentration that can 
be included in the critical region. This will depend on whether the inequality 
(8) above defines a region independent of the particular hypothesis h of the class 
G(h). Even if there is no single region of content a which is exactly a ‘best. 
critical region’ for h Q with regard to all members of CQi), the general principle 
may still be used as a guide. We build up a critical region out of those parts of 
the hypercube where the probability density tends to he concentrated when the 
probability law departs from p(x | H 0 ) in the direction of the alternatives included 
in 0(H). 

Eor example, in my earlier paper (Pearson, 1938) I suggested as appropriate 
in the following situation a test which, while not based on a common best critical 
region, was selected so as to include regions of greatest density associated with 
alternatives of C(h). Eor the hypothesis tested, 

*(* m 

The alternatives are asymmetrical curves with the same mean and standard 
deviation as (10). A typical alternative would be the Type III curve 

A 1 M. 

p(x | H) = c(l + VA)A e vto, (II) 

n 


Biometrika xxxii 
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whose form departs more and more from (10) as increases from zero, but the 
class need not be defined as precisely as this. In this problem it appears that 
if n independent observations z v x 2 , ..., x n are available, the following is a good 
test of H q . Take as test function „ 


V 

q - ii m>, 

(12) 




where 

y'i — 5(0-2—y,) for 0« </*«<)• 2, 1 
— 0-2) for 0-2 0-8, 

y'i = 5(1 — i/i) for O-Scy^l, J 

(13) 


C x 1 


and 

H_^) e ^ 

(14) 


If H 0 is true it may be shown that — 2 log t Q is distributed as y 2 with 2 n 
degrees of freedom. Hence any desired significance level a, for Q, may be found. 
We should then reject H 0 when Q is significantly small. 

A more systematic method of dealing with such problems has been con- 
sidered by Neyman (1937) in his paper on ‘smooth tests’. 

8. To sum up, the position seems to be this. It has often been argued that, a 
statistical test need only depend on the form of the probability law associated 
with the hypothesis tested. In the case where H 0 concerns the probability law 
of a single random variable and where p(x | H 0 ) is precisely specified, by the trans¬ 
formation from x to y it has been shown that the problem of testing H 0 on the 
basis of n independent values of x can always be reduced to another problem, 
which involves this question. Can we regard a sample y v y 2 , y n as having 
been drawn from the rectangular distribution p(y | h 0 ) = 1, where We 

are faced with a single fundamental question and we have to consider whether 
it can be answered in a rational manner, unless we are prepared to take into account 
the kind of departures from the rectangular law that we either believe possible 
or at any rate consider it most important to be on the look out for. 

The transformation from x to y seems to have the advantage that it con¬ 
centrates attention on the main point at issue. That is my reason for emphasizing 
it in these Notes. Most of us have many preconceived ideas about appropriate 
tests if the probability law is taken in the form of p{x | H 0 ); we are accustomed to 
use the mean, the standard deviation, certain functions of moments, the y a test,.... 
But we are not so accustomed to test whether a sample comes from a rectangular 
distribution and we are therefore forced or, indeed, more willing to reconsider 
from first principles what course we should follow and why. 
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In dealing with the problem of testing statistical hypotheses J. Neyman (1937) 
and E. S. Pearson (1938) have considered the use of the probability integral 
transformation, which leads to a theoretical uniform distribution. This method 
presupposes that the usual comparison between theory and observations has 
already been applied. We shall first improve this comparison by introducing 
control curves. Then we shall apply to the uniform distribution the usual methods 
and the control curves. This will lead to simple tests for given statistical hypo¬ 
theses. 


1. Control curves and the probability integral transformation 

Let a; be a continuous random variable for which n observations have been 
made. Let x m be the observed values arranged in increasing order of magnitude, 
m (1,2,...,«) being the serial number, The simplest way of representing the 
observations is to plot the cumulative histogram x m , m. The relative number 
1F (0) ( x m ) of observations less than or equal to x m is given by 

m = nW i0 \x m ). 

The consecutive differences 

n W (0) (a;, n ) — n W (0) (x,) (I < m) 

constitute the observed distribution. Many statisticians present, instead of the 
original observations x m , only the number of cases within certain arbitrary classes. 
From the practical standpoint this means a simplification, from the theoretical 
standpoint a complication. We shall suppose that all x m are known. 

The choice of a probability density to be applied to the observations con¬ 
stitutes the hypothesis. The probability density w (x.cq, c 2 , ...), where c v c a , ... 
are the constants, is called the theoretical distribution. For sake of simplicity it 
is assumed that all observed values x m have the same theoretical distribution. 
The probability W ( x , c t , c 3 ,...) of a value equal to or less than x is given by 

W(x,c 1 ,c 2 ,...) — j* w{z,c 1 ,c i , ...)dz, H) 


21-2 
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where 2 is the variable of integration. It is customary to compare the observations 
x m , m with the cumulative frequency curve x, nW(x). This comparison can be 
improved, in the following way; The mth observation is a statistical variable 
distributed according to 





mW{x) m ~ 1 {l — W{x)) n ~ m w(x). 


( 2 ) 


In a previous article (Gumbel, 1935) it has been shown that, for an ordinary 
unlimited distribution, with large n and with m of the.size in, the distribution. 
(2) converges towards a normal distribution with a mean given by W{x) = m/n 
and a standard deviation 


1 j W(x)(l-W(x)) 

w(x) ff n 


(3) 


This formula does not contain m explicitly. Since each theoretical value x can 
be interpreted as an mth value, (3) gives its standard deviation. The interval 
x + cr will be called the control interval. Under the above condition, the probability 
that an mth value will fall within the control interval is about f. The two curves 
obtained by plotting x + <r, nW{x) will be called control curves. 

For a given initial distribution we shall have to find the mean and the standard 
deviation of the mth value which may differ from those of the general solution, 
especially if n is small. The control interval will be a certain function of w(x). 
Also the probability associated with the interval x + a may differ from that of the 
general solution and may depend upon x. For the exponential distribution 
(Gumbel, 1937) the precision diminishes with increasing values of the variable. 
Below we shall apply this control to the uniform distribution w{x) — constant. 

The calculation of the probability ffl{x) and the control curves can often be 
, simplified by an indirect method. For certain, but not all, distributions it is 
possible to eliminate the constants by introducing a new variable y as a function 
of x, where 


® =/(^c 1 ,c 2 ,...). 


(4) 


Accordingly, the probability that a value of the transformed variable will fall 
in the interval y to y -|- dy is 

w(x) dx = w[f(y)] | f{y) | dy. 

We call p(y) = w[f(y)] | f(y) |, 


or 


p(y) = w{x) 


dx 

dy 


(6) 


the distribution of y. The probability V(y) of a value equal to or less than y 3s 


V{y) = W{x,c v c »...)., (6) 

The transformation (4) is chosen such that the expression V(y) does not contain 
any constants which depend upon the observations. Therefore V(y) can be 
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calculated once and for all as a function of y. Such tables have been calculated 
for the distributions for which this reduction is possible. 

In order to compare the cumulative histogram x m , m with the cumulative 
frequency x, nW{x), and to use the control curves, it is first necessary to compute 
the constants c x , c 2 , .... If the method of moments is used, the area between the 
cumulative histogram and the horizontal line W = 1, the arithmetic mean, is 
conserved. The value of the variable x which corresponds to a selected, numerical 
value of W(x) is obtained from the transformation (4). 

A special case of the transformation (4) which leads to a reduced distribution 
of astounding simplicity is the probability integral transformation due to Karl 
Pearson (1933) who introduced for y the probability function lf(.r). Since 

dx 1 

dW~w^)’ (7 > 

we obtain from (6) p(W)sl. (g) 


This identity means that the distribution of the probability is constant. As 
p(W) is the probability density of a probability, it is difficult to establish its 
philosophical meaning. But formally the construction is valid and the corre¬ 
sponding value can be observed. It is our purpose to give .several methods of 
judging the significance of the differences between the theoretical distribution (8) 
and the corresponding ‘observed’ distribution. 

The word ‘observation’ will be given a special meaning. A certain theory h a 
which involves the choice of certain constants c x , c 2 , ... applied to the observa¬ 
tions, leads to the values 


TT 0 = W{x m> Ci,c 2 ,... | A 0 ) (w= 1,2 ,...,to), 

corresponding to x m . These values, contained in the interval 0, 1, are the ‘ obser¬ 
vations ’. To any other set of constants c{, cj>, ... will correspond other ‘ observed ’ 
values 


W' 0 =W(x m) c[,c 2 ,. 


u 


Therefore any test applied to the ‘ observations ’ of formula (8) might be used to 
judge the choice of the constants. To another hypothesisA x containing the con¬ 
stants d v d 2 , ..., will correspond another set of ‘observed’ points 

= Wi x m> ^1> • • • I ^l)* 

The same observations when interpreted by different theories or different 
constants lead to different ‘ observations ’. An incorrect theory involving properly 
chosen constants might give better results than a correct theory involving im¬ 
properly chosen constants. Therefore, to compare different theories, the constants 
for each must be determined with the same precision. In practice this condition 
will not always be fulfilled. For the precision of the characteristics depends upon 
the distribution, for which they are calculated (Gumbel, 1936). Therefore, the 
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same method of determining the constants might lead to different degrees of 
precision for different distributions, whereas different determinations of the 
constants might lead to approximately the same precision. 

There is another point for caution: all tests derived from the probability 
integral transformation apply to the analytic form of the hypotheses and at the 
same time to the choice of the constants. A formula containing many constants 
may reproduce the observations more closely than a formula containing few 
constants, even though the constants in the first hypothesis have no meaning. 
Therefore we must limit our comparison to hypotheses containing the same 
number of constants. .For any set of statistical observations in the ordinary sense, 
there will usually correspond a small number of tenable hypotheses. We shall 
suppose it is known what they are. For we do not try to find a formula for the 
sake of doing it, but to explain the observed facts. We will not go so far as Neyman 
(1937), who formulated all possible alternatives by a series of orthogonal 
functions. 

In theory the points representing W(x m ) are distributed uniformly in the 
interval 0,1. This is true for any hypothesis, provided the variable is continuous. 
But in practice this will never occur. The ‘observed’ points corresponding to any 
given hypothesis will differ from the theoretical set, even if the hypothesis is a 
very good one. The differences between the ‘observed’ set of points resulting 
from h u and h x and the theoretical set allow the construction of tests which 
can be used to judge which of two given hypotheses is the better. But no 
statistical method gives an answer to the question whether or not a hypothesis 
is true. 

After a hypothesis has been selected, the preliminary steps which have to be 
made before it is possible to use the probability integral transformation, are: 
first, the determination of the constants; secondly, the calculation of probabilities 
W(x ) for the values of x given by the transformation (4); and thirdly, the calcula¬ 
tion of the probabilities W(x m ) of the observed values. It is only after these three 
operations have been carried out that we obtain the ‘observations’ which are 
to he compared with the theory (8). Therefore, any test based on the probability 
integral transformation presupposes the usual comparison of the observed 
cumulative histogram with the frequency ourve. In many cases this comparison, 
checked by the control curves, will indicate a clear superiority of one of the 
theories. If this is true, there is no necessity for a new test. 

It would be interesting to investigate the best criterion for judging the 
significance of the differences between the ‘observed’ and the uniform distribu¬ 
tion (8). But for practical purposes it is sufficient to know whether the differences 
for h 0 are smaller or larger than for h v First we shall establish rough measures of 
comparison; afterwards, more refined ones. 
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2. Classical tests applied to a uniform distribution 

The comparison, of an observed distribution of a continuous variable with the 
theoretical distribution is reduced by the probability integral transformation to 
a comparison of ‘observed’ points with a uniform distribution. It seems logical 
to use first the classical methods which are here very simple, as no constants have 
to be determined. For a uniform distribution 


p(y)=l (0£y£l), 

the arithmetic mean and the median are, respectively, 

y = y = h 


(9) 


The mean error 6 and the probable error p, defined as half of the difference between 
the two quartiles, are 

Q - P = i (10) 

The Jfcth moment about the origin is 

M k = 


which gives the recurrence relation 


Mu 


k + l’ 


Mu~ 1 ' 


( 11 ) 


Since the distribution is symmetrical, the odd moments about the arithmetic 

mean vanish. Therefore D n 

Pi = 0. 

The even moments are 


Pafc = f [y-\Y kd y = 2 fVdz or = 
Jo Jo 


1 


2 2k (2k +1) ’ (12) 

Therefore the standard deviation, the coefficient of variation and the second 
beta are, respectively, 


Cr ~ 2 ^ 3 ’ 

(13) 


(14) 

P* = I • 

(15) 


and 

It is necessary now to calculate the ‘observed’ means, the measures of dis¬ 
persion, and the relations between successive moments. To control the agree¬ 
ment between the theoretical uniform distribution and the ‘observed’ points 
we can still employ the standard error cr^ of the arithmetic mean. The general 
formula _ 
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becomes according to (13) cty = . 

The standard error of the dispersion for n large is cr(a 2 ) 
becomes, according to (12), 

^ = y(i%(5~9)} = 6V(5 »)' 


(16) 

which 

(17) 


It seems reasonable to employ these old-fashioned tests before the use of 
more sophisticated methods is resorted to. Only if they fail would it be necessary 
to consider more elaborate methods. 

The n ‘observed’ points }F(a; ? J represent the probabilities, obtained from the 
hypothesis h Q , of the given observations in the ordinary sense, x m . We plot these 
points in the interval 0, 1 which is divided into k cells of equal length, where k 
is chosen in such a way that njk is an integer. If n is a multiple of 10, we choose 
the cells (0-0,0-1), (0-1,0-2), ..., (0-9,1*0). 

The probability density of a point falling somewhere within the interval 0, 1 
is constant. As the interval is of length 1, this density is 1. Therefore, the prob¬ 
ability of a point lying with a given cell is 1 jk, and the expected number of 
points in each cell is njk. The ‘observed’ number of points obtained through a 
hypothesis h 0 will be a v (v = 1, 2,..., k). If we apply another hypothesis h x to the 
same observations or introduce other numerical values for the constants, the new 
set of' observed ’ points will lead to values b v which, in general, will differ from a v ■ 

The classical statistical method of treating this material is the y z tost. As the 
numbers a v , b v will differ from the expected number njk, we can calculate for both 
hypotheses , k . , 2 ; k . 

• a-uM- < 18 > 


The better hypothesis will have a lower value of y a and a greater value P, where 
P denotes the probability of obtaining the ‘observed’ deviations from uniform 
distribution or larger ones. The probability P depends upon the number of cells 
chosen. Therefore, to compare two competing hypotheses, the same division must 
be used. 

The application of the y 2 test to the ‘observations’ W(x m ) eliminates an 
arbitrary action which is a serious and well-known drawback of the y 2 test, when 
applied to the original observations x m . The expected contents of the classes 
depend upon the distribution. Therefore certain classes, as a rule the first and the 
last, must be chosen such that the expected number is not too small, otherwise 
y 2 becomes very large. In our case, no cell differs from any other and no arbitrary 
combination of cells is needed. We can choose k = n. The mean number of points 
in each cell will then be one and 



n 


SK-i) 2 , 

>>=i 


= £ (&„—!)*• 


V=1 


(18') 
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This choice removes another drawback of the x 2 method: different classifications 
■used for the same observations lead to different shapes of the distribution and 
therefore to different values of x 2 . Here the classification is prescribed once and 
for all. 

Another comparison between theory and ‘observation’ may be based on the 
fact that different sets of points a v and b v have different probabilities. The prob¬ 
ability that a v points will fall within the cell v (= 1,2,.. k) is 

n ! /\\a 1 +a 1 -r...+iij. 

a x \a 2 \ ...o^! \hj ’ 

where a 1 +a 5 ,+...+ a fc = n. 

Since the factor n ! k~ n is constant, it is sufficient to investigate 

n -TT—• (19) 

nv 

V=>1 

Of course 17 < P, as the latter probability applies to the ‘observed’ deviation or 
larger ones. The statement ‘The probability for a v points to be contained in the 
cell v is proportional to 17’ may be inverted according to Bayes’s principle. 
Therefore, 17 is proportional to the probability that the distribution of point© 
is rectangular, i.e. that h 0 is a good hypothesis. 

The question for which set a„.the probability 17 is maximum, is the starting- 
point of the classical relation between entropy and probability. For large n the 
most probable set of points is the one which has the same number of points in 
each cell, i.e. 

= ... = ( 20 ) 


Let us call 17 max . the probability which corresponds to this distribution. The 
probability of the hypothesis h 0 will be greater, equal to, or less than the prob¬ 


ability of h v if 


H° > 17 x 
H m ax. H raax , 


( 21 ) 


The relative probability of both hypotheses will be IIJIIq or n o /n it depending 
on whether _ ^ „ 


As these probabilities depend on the number of cells, the same division must be 
used to compare two competing hypotheses. We can choose k —n. Then 17 max , — 1 
and we can use 17 0 as test. 

The entropy test (21) is closely related to the test (18). This classical 
relation can be obtained in the following simple way: if q is the constant prob¬ 
ability of a point falling within a given cell, then for n observations the expected 
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number of points within a cell is nq. But the ‘observed’ numbers a v will differ 
from the expected number by e„ so that 


a, = nq+e v , 

fc 

where £ e » — 0. 

The quotient (21) becomes 

n« _ X (ng)l 
-^max. »=i (®? 4 ®n) • 

When n is large, each factor becomes, by application of Stirling’s formula. 


nq\ 


(Mg + e, 

Expansion of the logarithm leads to 
nq 


yr (4)*' «p[- ( »9 + o.+i ) b( 1 +^)_ • 


(nq + 


e v )\ [nq) GXP [ e ” 2 nq 2 


According to the meaning of e„ we obtain 

n 0 


n„ 




whence 


n, 


17, 


®-= e-ix 8 . 


max. 


( 22 ) 


Therefore, when n is large, the entropy test becomes identical with the % 2 test. 
This result was derived by Neyman & Pearson (1928), when they showed that 
the x 8 test followed from their ‘likelihood ratio ’ method of approach. 

Neither test will give an answer, if the number of points a v assigned by h 0 to 
the cell v is equal to the number of points b x assigned by \ to the cell A, a y = t A , 
where for any v (= 1,2, ...,k) it is possible to find a A ( = 1,2, ...,&), such that not 
all A = v. An example of this occurring is shown in Table 1, col. C and E. The 
reason for the failure is that we do not make use of the actual position of the 
observed points within the cells. We only ask in which cell they are situated. 
Although in such a case the tests do not show any difference between the arrange¬ 
ments a„ and b x , some conclusions might be drawn from such ‘observations’. 
If the number of points falling in the first few cells and also in the last few cells 
is disproportionately large, and if there is a deficiency in the middle cells (Table 1, 
col. D), we have to conclude that the distribution h 0 is too concentrated or that 
we have chosen too small a value for the constant which depends only on the 
standard deviation. If the number of points in the cells at either end is small 
(Table 1, col. E), the inverse inferences follow. These considerations may give a 
hint about the choice of an alternative hypothesis. , 
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To illustrate the above methods, let us take the fictitious example given by- 
Pearson (1938) in his Fig. 2, p. 136. He arranges n = 10 points in k = 10 cells 
and considers six sets A, B, ..., F, given in Table 1. 

Let us suppose that these six sets are the results of six different hypotheses 
applied to the same observations. The y 2 test leads to 

Pa - > Pd > Pc = Pp > Pb = Pe- 

The probabilities of the various columns give the same ordering 

P, max. = P- a ^ Pd - > Pc ~ Pp > Pb — Pp- 

The most probable set contains one point in each cell (set A). It is not possible 
to decide whether C is more probable than F, and whether B is more probable 
than E. 


Table 1. Pearson's set 


Class 

A 

B 

C 

D 

E 

F 

0 - 0 - 0-1 

1 

2 

0 

2 

0 

0 

0 - 1 - 0-2 

1 

3 

0 

1 

0 

l 

0 - 2 - 0-3 

1 

2 

0 

2 

1 

2 

0 - 3 - 0-4 

1 

1 

0 

0 

2 

2 

0 - 4 - 0-5 

1 

0 

1 

0 

2 

0 

O - S - O-6 

1 

1 

2 

1 

3 

1 

0 - 6 - 0-7 

'1 

0 

1 

0 

1 

0 

0 - 7 - 0-8 

1 

1 

2 

1 

1 

0 

0 - 8 - 0-9 

1 

0 

2 

1 

0 

2 

0 - 9 - 1-0 

1 

0 

2 

2 

0 

2 

X 3 

0 

10 

8 

6 

10 

8 

P 

1 

0-350 

0-534 

0-740 

0-350 

0-534 

n 

1 

•h 

A 

i 

1 

2 4 

rW 


The y 2 an d the entropy test are based upon the same data. But the results 
reached are incomplete, as artificialities are introduced by the classification of 
the ‘observations’ into the arbitrary cells. The actual position of the points 
within the cells is not used. The set A shows that these tests may be misleading 
in still another way. Each cell in set A contains exactly the expected number. 
But it would be false to conclude that the hypothesis is true, since the actual 
positions of the points within the cells might differ from the ideal positions. 

Let us suppose we know these positions. It might then happen that the 
difference between the observed and the ideal positions of the points is smaller 
for a set K than for a set L, even if the differences between the actual and the 
theoretical number of points is larger for K than for L. 

It is now our task to assign a meaning to the term ideal position and to define 
a measure of the differences between the ‘ observed ’ and the ideal set. 
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3. The with point test 


The ideal position of n points, distributed with uniform probability over the 
interval 0, 1, is such that the distances between consecutive pairs of points are 
equal. But there are a number of ways of distributing to points equidistantly 
over the interval 0,1. E. S. Pearson, in the preceding note, suggests that 


Vvi 


2m — 1 
2 n 


(23) 


might be used as the ideal position of the mth point. However, as y is a statistical 
variable, we should represent it by an average, to choose which we must consider 
the distribution of the mth point.* Any observation chosen at random has the 
same probability of falling on any position y within the interval. But for the mth 
point this probability depends upon y and m. The initial distribution w of the 
variable W{%) = y is constant. According to (8) 

w(y) = 1 (0<ygl). 

The probability of obtaining a point equal to or less than y is y. According to 
(2) the distribution of the mth point is 


^Jy,n) = J my m ~ l { 1 - y) n - m . (24) 

The distribution (24) is of Karl Pearson’s Type I. For m — 1 (and m — n) the 
distribution will only decrease (increase). The distribution of the mth point is 
equal to the distribution of the (to - m + l)th point. If we replace y by 1-y 

hV-m+iO -V,n) = ID m (y, n). (24') 

The most probable position y of the mth point is given by 

re —m _ m— 1 

i-y ” V ’ 


which leads to 


y« — 


m— 1 

TO— 1 ‘ 


(25) 


For given values of m the median position can be obtained from the tables of 
the Incomplete Gamma Function, To find the arithmetic mean y and the control 
curves, it is necessary to have the moments M k of (24). They are 

Mk = t) m J 0 2/ra+fc ~ 1(i_2/ ) ,, ~ ro ^' 

According to the well-known properties of the Gamma function 

^ __rel_ (m + fe- l)i (TO-m)i _ to! (m + k— 1)! 

k (m-1)! (TO~m)! («, + &)! (n + k )! (m— 1)! 

* [The distribution of a ranked individual sampled from a rectangular population, and the 
moments of this distribution, were obtained by Karl Pearson in the first of two papers (1931, 
p. 390, and 1932) dealing with ranked variates. Ed.] 
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(26) 


Vm = 


wi 


» + !’ 


and for k — 2 the second moment is 




_ m +1 


Finally, the variance 


or. 


2 _ 


n + 2' 
m(n—m+ 1) 


(27) 


(28) 


(n+ 1) 2 (ti + 2)' 

These formulae apply also to the cases m = 1 and m-n. The standard deviation 
of the with value may he written 

Vm) 


-j* 


71 + 2 


(28') 


This formula differs slightly from the general expression (3), and leads to an 
unexpected result: as we approach the centre from either side, the precision of 
the with point decreases. The precision of the with point will be a minimum for 
wi » |ti+ 1, if 71 is even, and for m = |(n+1) if n is odd. The values of <r m *J(n + 2) 
are given in Table 2. 


Table 2. Standard deviation of the mth point 


im 

Vm 

<r m f(n+2) 

0'05 

0-95 

' 0-21794 

0-10 

0-90 

0-30000 

O'15 

0-85 

0-35707 

0-20 

0-80 

0-40000 

0-25 

0-75 

0-43301 

0-30 

0-70 

0-45826 

0'35 

0-05 

0-47697 

0-40 

0-60 

0-49000 

0'45 

0'55 

0-49750 

0-50 

0-50 

0-50000 


We must now decide whether to use the mean (27) or the mode (26) as the 
ideal position of the mth point. The modes of the first and of the last points are 
0 and 1 respectively, whereas the corresponding means are 

= 9 *" 1_ n+l* 

As the ‘observations’ Wo^) and W Q (x n ) of the first and the last point differ 
from 0 and 1, the arithmetic mean is to be preferred. 
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Formula (27) gives the ideal position and therefore the theoretical numbers 
of points in each cell which can he compared with the ‘observed’ numbers. This 
method leads to an improvement of the tests (18) and (21) where the choice of 
the cells was still arbitrary and where the actual position of each point was not 
taken into account. 

Besides comparing the uniform distribution with the ‘observed’ position of 
the points we can use the corresponding cumulative frequency. The probability 
scale y is plotted as abscissa and m as ordinate. We count the number of points 
below m. The mean position y of the with point becomes a straight line differing 
from the diagonal which represents the modal positions. The figure opposite 
traces, for n — 20, the mean, the modal and Pearson’s position of the mth point 
given by (23). 

In the same way we plot the ‘observed’ points obtained by h 0 , h v .... These 
probability points W Q (x m ), Wfx m ) will be scattered about the straight line. Usually 
the area between the observed cumulative frequency curve, the ordinate and the 
parallel to the abscissa is kept equal to the corresponding area for the theoretical 
curve. Since for the present problem no constants have to he determined, we 
have no way of enforcing this equality. The area J bounded by the diagonal 
Straight line through the points with the co-ordinates mj(n+ 1), m (m = 1, 2,. 
the length lj(n+ 1) to n/{n + l) of the abscissa axis and the two parallels to the 
ordinate axis, is J _«»_!). ( 29 ) 

This might differ from the area J® of the n— 1 ‘observed’ trapezes 
tj m ,rn,m+l,y m+1 (m = 1 , 2 , 1 ). 

As y m are the ‘ observed ’ points 


n~l 

■■ 2 (w+i) {y m + x -y m ) 

771=1 


Xy m (rn-l) 

i 


n —1 j n 

- 2 my m + ~^y n 

l 2 


1 n ~ 1 
■ 2 2 Vm 


= ny n -zy m +Uy n ~yi)- 

i 


If we replace each value y m by its expectation from (27), we get 

= !(«-!), (29') 

as it ought to be. The ‘observed’ area is not equal to the theoretical area, but its 
expectation is. It might happen that the numerical value of J (0) is very close to J 
as a result of compensating deviations. Therefore this numerical comparison can 
be used as a test only in connexion with the graph of the * observed ’ and theoretical 
cumulative histogram of the mth points. 
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To control the agreement between the ‘observed’ cumulative histogram and 
the ideal straight line we use two control curves through the points yTcr, m, 
where y is given by (27) and a by (28'). They are traced in the figure for n = 20. 



Probability y ~ W(2„) 

Fig. 1. Control curve for uniform distribution. 


-- Mean mth point y (27) —- Modal mth point p (25) 

-Pearson’s mth point $ (23) |—(—|—j Control curves 


It is interesting to calculate the area A bounded by these two curves and to 
compare it with the area J. If we consider m as abscissa and y + cr and y-cr as 
ordinates we have, for n sufficiently large, 


A =J [y+a--(y-o-)]dm 



n 

<r dm. 


i 


If we introduce y as variable of integration we obtain from (28') 


2(71+1) C nKn+1) IS *, 


*J( n + 2) J i/(n+l) 
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The transformation 

y — sin 2 1, 1 - y = cos t, dy -- 2 sin t cos t dt, 
leads, as is well known, to 

= i[<-sin£eo8«Jgj. 

71 3 . 

The limits are given by 

“'•-.Mi' 

For the expansion of t 0 it is sufficient to put 

, .11 1 _ I 

0 arCi ijn+l f(n+l) 6(n + l)*J(n+l)’ 

provided (w+1) 2 > 1. Under the same condition, 
arc sin f(l — £ 2 ) = arc cos x 

— Jrr — arc sin a:, 

becomes for any | x | < 1 

arc sin ^(1 —as 2 ) = In — x + lx 3 . 


Therefore 


so that 


t --- 
h ~ o 


1 


1 


2 J(n + iy6<](n + l)(n+l) 

tj-tp _ 77 1 1 


2 4 f{n^- 1) 6{n +1) f(n + l) 

The second factor in the brackets becomes 

2smi 0 ^(l-sin 2 £ 0 ){cos 2 i 0 ~sin 2 t 0 ) — 2aint 0 f(l ~sin 2 J 0 ) (1 — 2sin 2 t 0 ) 

— %^ n ( n ~ jj 

(n+l ) 2 

In the same way 

***-*>*> - - ~ 2 i£r- 

Finally, we reach the area bounded by the control curves 

. n + 1 /n (n-l)fn 1 1 

~f(n + 2)\i + (n+1) 2 ~ffa + T) + 6(n+l)f(n + l)) 

According to (29) the ratio of the area bounded by the control curves to the area 
of the cumulative histogram 


))• 


A w+1 


J n-Q{n+2)\i + («.+ l) 2 ^(n+l^Gin + ljfin+l) 
converges towards zero as 1 :fn.- 


(n-l)*Jn _1 




(30) 
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The propart™ (28', and (30) of the emulative histogram „f the position, „f 
the nth pomts allow for the comparison of the 'observations' IK(r ) a „ d the „! 

ff* T + ‘ y d t ; 1 1 oft T be “ to - **£££££ 

observatron, and theory to judge which set of 'observed' points is doser! 
on the whole, to the theoretical positions. It seems legitimate to prefer a hypo- 
thesis h 0 if the control area contains more points for h 0 than for A 

In order to secure a numerical test, we can introduce the mean ®» of the sum 
of the squaies of the differences between the ‘observed’ values y = W(r ) anrl 
the mean positions y m = m[(n +1). Take " “ 


b n 


l m \ 2 
V m n+ 1)> 


(31) 

where the value of the constant k will be specified later. One extreme for (31) 
would be to have the theory hold for every point. Then the value of the sum 
would be zero. The other extremes would be when all points are concentrated 
either at the origin, zero, or at the end, 1. In the first case 

1 


£ o 2n +1 

r 2 £ m = 


n(n + l) a „iTi " ~ 6(w + 1)‘ 

The second ease leads to' the same value, since Em = £ n (n + 1) and therefore 

u 1 


1 S (l— 

n + ij 


1 + 


n(n+ l) 2 


n 

s m 2 . 

vi -1 


Therefore 




2 w+1 k 

'6(71+ l)^ 1 


(32) 


In order to draw conclusions from an observed value $ 2 we have to calculate its 
expectation <3 2 . We will determine k in such a way that & is independent of n. 

A test similar to (31), but serving another purpose, has been introduced for 
the usual distributions by H. Cramer (1928) and R. von Mises (1931). When 
applied to uniform distributions, this w 2 test leads to the use of \j m of (23) instead 
of the mean value y m . For this test the sum of the deviations is zero which does 
not hold in our case. The expectation <3 2 of 0 2 is 

2 my m 


@2 


- “(if*- 




m* 


r-])' 


n+1 (n+l) 2 

The first two sums are obtained from (27) and (26). Therefore 


<S 2 == 


I 2m(m + i) 

» \ 

\(w+l)(?i+2) 

(»+1)7 


t* 


m 


n 

Sm 2 

i 


7 i(ri -4* 1) \U 4* 2 {yi * 4 - 1) (71 -f- 2)/ 
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The introduction of the sum of the powers of the natural numbers leads to 


6 2 = 


/ 2»+l\ 

)V 3n + 3/ 


2(w + 2)\ 3W + 3/ 6(w+l)‘ 

Taking k = 6(n +1) we propose therefore, as test of a hypothesis h 0l the coefficient 


@2 = 


6(n+1) 


n 


) * ( mV 


(33) 


which, according to (32), can assume values between zero and 2w+1, and has for 


expectation the value 


S 2 = 1 . 


(34) 


Of two competing hypotheses, the one with the smaller value of S 2 is to be 
preferred, 

The S 2 criterion does not introduce any arbitrary classification. It makes 
use of all observations. Besides the probabilities W(x m ) corresponding to the 
observed values x m no new calculations are needed. The test has a clear meaning 
and its application is simple. This is due to the fact that it is a natural consequence 
of the probability integral transformation. 


Summary 

We propose the following procedure for testing statistical hypotheses: The 
constants for competing hypotheses, having the same number of constants, are 
determined in such a way that their precisions are approximately the same. Then 
we calculate the probabilities lh 0 (a: m ), W^(* m ), ..., and their respective control 
curves. We trace JF(a:J, x m + cr rn and compare it with the observed frequency 
curve. If neither the classical tests nor the control curves indicate a clear superi¬ 
ority of one of the hypotheses we consider the probabilities as ‘observations’ 
and plot the corresponding points on the y axis. We now compare, by formulae 
(9)—(17), the ‘observations’ with the theoretical uniform distribution and apply 
the x 2, an( i entropy test of formulae (18) and (21), respectively. If necessary, we 
repeat these tests in such a way that the actual position of each point is taken into 
account. Formula (18') gives a value of x 2 which is independent of the classi¬ 
fication, Then we plot the cumulative frequency of the ‘observations’, which is 
compared with the straight line (27) and controlled by the values given in Table 2. 
The final test is given by (33). 
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THE RANGE IN RANDOM SAMPLES 
By H. 0. HARTLEY 
1. Introduction 

If the observations x v (v = 1, 2, ...,n) of a random sample are arranged in 
ascending order of magnitude (a:„.|. 1 > x v ) the range w in such samples is defined as 
the distance between the two extreme observations 

w = x n — x v 

It may therefore be regarded as a measure of the variability or dispersion among 
the observations of the sample. Theoretically its efficiency in the sense defined by 
R. A. Fisher is, as a rule, much inferior to that of the standard deviation. More¬ 
over, extensive investigations have shown that its random sampling distribution 
is markedly dependent on the parental population (E. S. Pearson, 1926). For 
large samples x p drawn from a parental distribution f(x) the extreme values 
and x n will lie right inside the lower and upper tail of f(x), and in practice it is 
only in exceptional cases that the exact shape of f{x) has been established to such 
a degree of accuracy that the resulting distribution of w can be trusted for large n. 
In most cases the use of the range must therefore be limited to small samples, 
say with 2 < n < 20. 

Large numbers of small samples may often be used with advantage when the 
mean range is calculated as an estimate of the standard deviation of the popula¬ 
tion (Pearson & Haines, 1936). Although theoretically such an estimate is not 
efficient and certainly not sufficient, it is nevertheless of considerable importance 
in many fields of application because of its simplicity. Statistical control charts 
in industrial quality control make extensive use of it, and more recently the 
range has been applied to investigations in gunnery. 

In some fields of application a disadvantage may arise from the fact that the 
range is an inexact statistic; its random sampling distribution depends on the 
standard deviation of the parent. This applies in particular to the analysis of 
small samples in biological experiments. The tendency of modern small sample 
theory has been to replace such statistics by what are called exact statistics, 
obtained by substituting for the unknown standard deviation of the parent an 
estimate calculated from an independent sample. This particular process of 
reaching exact statistics has sometimes been referred to as ‘Studentization’. 
A general theory of this process will be given in a further paper which it is hoped 
to publish in this journal, where it will be shown how estimates of scale parameters 
in general, and of the range in particular, may be converted into exact statistics. 
In this paper, however, we deal with the case where the standard deviation of the 
parent is known. Indeed, it is this dependence of the random sampling distribu¬ 
tion of range on the scale parameter of the parent which makes it possible to 
estimate its efficiency as an estimate theoreof. 
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The question of grouping has been a subject of investigation in the ease of 
the sample standard deviation; we shall here deal with the effect of grouping on 
the range, a problem which has so far received, we believe, no attention what¬ 
soever.* As practical examples of the occurrence of grouping we may quote 
three instances: 

(а) The rounding off of data for convenience of recording and analysis. 

(б) The recording of data to the nearest unit of measurement. Where the 
technique is of low accuracy (see e.g. Tildesley, 1940) the unit of measurement 
will be comparable in magnitude with the standard deviation of the actual data. 

(c) The analysis of data which are classified in categories, In such cases we 
may often find that the original data are unobtainable so that group frequencies 
are the only material available for an analysis. 

It will also be shown how the random sampling distribution of the range in 
grouped samples provides a suitable approach to that of the true range (ungrouped 
range in sample) on which extensive work has been done in earlier papers published 
in Biometrika. The mathematical formulae developed in this paper make this 
complex distribution amenable to a tabulation. For the case of normal samples, 
the work has actually been carried out and the resulting tables of the probability 
integral are given and discussed elsewhere in the present issue of this journal, 

2. The distribution oe the range in a grouped sample 

Let us denote by aq, x n the observations in a random sample drawn from 
the parental distribution f(x) f and arranged in ascending order of magnitude. 
This sample is now classified in groups or categories of constant length h with 
equidistant end-points 

..., g-kh, .... g-h, g, g+h, ..., g + kh, ..., (1) 

covering the whole x scale from — go to + co, Let us denote by g n and g x the 
respective centres of the categories containing x n and aq. Then the problem is to 
find the random sampling distribution of the range in a grouped sample, i.e. of 
£ n — g v The mean of this distribution is of particular interest. Obviously this 
statistic can only assume values which are multiples of the group interval h and 
is therefore discontinuous. Like the distribution of the ‘ungrouped 1 or true range 
it depends on the standard deviation cr of the parent f(x). In addition, it depends 

* The elfect of grouping seems to be of some importance in researches on the technique of 
anthropological measurement (Tildesley, 1940), where some of the results given below have already 
been applied before this paper had gone to pross. 

t We shall deal here with a parental population represented by a ‘piecewise continuous distri¬ 
bution function f(x). A function is called ‘piecewise continuous’ for ~coct<+co if in any 
closed interval of x the function f[x) is continuous apart from a finite number of ordinary dis¬ 
continuities. If the actual range of the variate is bounded we simply define /(*)=0 outside this 
range. Moreover, we assume that j{x) has contact of at least second order at +co. It is easy to see 
how our results may be generalized to cover distribution functions with singularities. 
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i+m\n 


on the category width h and on the position of the category midpoints relative to 
the population mean X. Of these parameters only h will in practice be known. 
Methods to eliminate cr are to be given in a separate paper whilst the elimination 
of X is dealt with in the section on randomized grouping (6). 

It will be convenient to use the following notation: 

rj rt+jh . fi Cg-t-ift /*« (*« 

= f(x)dx, = f(x) dx, = f(x) dx. 

Ji J £+1h J — oo J —a> Ji J Z+ih 

Let us now find tlie chance that £, n — 

is at most (m — 1) h, and that in addition 

for a particular value of i. This chance is given by 

) n [ l'i+m\n 

-(J J • (2 > 

The first term in (2) represents the probability for all x v to lie between £ + ih and 
£, + (i+m) h. Prom this we have to deduct the chance for all x„ to lie between 
£ + (i + 1) h and £ + (i + to) h which is given by the second term of (2). In taking 
the difference we are therefore left with the chance for the occurrence of a sample 
completely contained in the interval £+ih to £+ (i + m) h but with at least one 
of the x { lying between E, + ih and £ + (i + l)h. This proves that (2) represents 
the required chance. Now, since all samples may be classified with regard to 
their lowest category, the probability fox £ u --£i to be at most (w— 1) h is given 
by summation over all i of the expression (2). If we denote this probability by 
P{n,h,m— 1, £) we find 

+ <o I/ l'i+m\n j ri+m\n\ 

)-(JJ). (») 

With equation (3) we have reached a formal representation of the random 
sampling distribution of £ n —£ v Its evaluation is a simple matter for large group 
intervals h and for parental distributions/(a;) with a tabulated probability integral 

f(x) dx. If we were to take the trouble of tabulating the probability integral 


/■ 


(3) we should obtain the mean of £ n — as a by-product from a summation of (3). 
It will be shown in the next section that this summation, if carried out analytic¬ 
ally, produces a very simple formula for this mean. 

3. The mean range in a grouped sample 

To find the mean of the distribution it is convenient to extend the summation 
in (3) from some finite negative value i = — j up to + oo. By choosing j sufficiently 
large the resulting error may be made negligible. We introduce 


co ~/Ci+m\n I rt+m\n~ 

f=-jLVi / \Ji+\ / J 


( 4 ) 
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and find for the difference between P(m- 1) and p^(m- 1)* 

-j ri +1 r-i+i 

| P(m-l)-p_,(m-l)|< 2 » =» , (5) 

i= — oo J i J -co 

for all m and j. To find the mean of £„ - E, x we must first note the probabihty for 
this statistic to be exactly equal to mh, where m = 0,1, 2,.... Denoting this 
probability by <j>(m) we have from the definition of P(m) 

= P(m)~ P(m- 1). 

If we denote the mean of g n — £, x by 3 we have by definition 


3 = h 2 <f>{k)k 
k =o 


= h lim {(m + 1) P(m) - S m }, 

m-> co 

S m = (m + l)^(0) + m^(l) + ...+^(m) 


( 6 ) 


m-> oo 

where 

or = P(0) + P(1) +...+ P(wi). (7) 

To find 3 let us first consider the second term in formula (6). We have from 
equations (7) and (5) 


S m = p_,(0) + ... + 2 >_,(m) + e lJ 
r-i+i 

where I e i I ^ + 1)J . 

for all j and m. Now from the definition of p_ } (k) we find 


m to I oo r~/ p<+ft+l\n / /•i+fe+l\n - 

)-(L )] 

m (I r-j+k+l\n co r/ j'i+k+2\n / ri+k+l\n~] 

=,?.!(L ) + ,?I(L ML )J 

to / ('-i-t-fc+l\n. to / t‘i+m+ 2\n 

■» 5 ,(j., K S iL )• 


Putting now mi = 2), we have 

2j 1+1 / ri \n 1+1 ( foo \n 

S P-#) = s + . s I ) 

/e~ 0 \J — co/ j-f-l \j i / 

where it is easily seen that 

| e 2 1« 2 ?tc f j , |e s |<2ynf + 2 (*-j)| 

J- co J1 + 2 i=i+l J 1 

Finally, we want to replace in formula (6) the first term 

(2j+l)P(2j) by (2j+l). 


+ e 2 + ^3 


ri+i 


( 8 ) 

0) 


( 10 ) 

(11) 

( 12 ) 


* In this section we deal with fixed group intervals and a fixed sample size n so that we drop 
the arguments n, h and 
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The resulting error is easily estimated. We have from (5) 

(2j’ + l) (1 — P('2j)) = (% + 1) (X - )) + e 4 


where 


|<m|«S(2? + 1)% 


I 


-j i-i 


( 13 ) 


Moreover, according to the definition (4) we may write 

( CO j /'i+2;+l\ n [f i+2j+2\7i 

W+UMW) -(J m 


<*£+aj i-s\ Ji 

M 1 


where | e 5 1 < 

so that finally we have 


+ (2j+l)[.S.( 

/ c+j+nr 

= (2j+l)( ) +e„ 

CO ri l 2j h2 C-tj 

(2j+l)S« <»(2i+l) 


i+2j+l\il 
i-l-1 


J II 


(2?+l)(l-p^(2j))--e 5 + 6 0l 


with 


<^( 2 j+ 1 )« 


nr +D- 


(14) 


(15) 


The error terms e,, e 2 , e 3 , e,„ e 5 and e„ are of the form 


< 1 ? 


[*» r-3 

; CJ 

' i J -o 


or 2 (i-c) 

j -j 


i+i 


It is easy to see that the above terms tend to 0 as j-± oo. To prove this for the 
first term we write 


(t+jh 

h 




\ Pea f* co J Pen 

+ 7J; < Ji 


xf(x) dx, 


which tends to 0 as f(x) has contact of order 1 at + oo. The proof for the other 
terms is identical. For sufficiently large j we can therefore use the approximations 
given by (10) and (12) and transform equation (6) into the convenient, form 


3 = Jilim. I (2j -f-1) 
i-+ 


j+1 / ri \n j'+l / r<n\n 

- s - s 

i=-i+l\J -co/ / 


(16) 


Equation (16) gives the mean range 3 in a grouped sample in terms of powers of 
the probability integral of the parental population. For a normal distribution 
this is a particularly simple formula since such powers have already been cal¬ 
culated by L. H. C. Tippett (1925) and are conveniently tabulated in Table XXI 
of the Tables for Statisticians and Biometricians, Part II. A table of 3 can, there¬ 
fore, be easily computed by adding a few entries from Tippett’s table and 
deducting the sum from the appropriate value of 2j + 1. 

This has been done for samples of five, ten and twenty observations grouped 
in categories of breadth h (see table on p, 339). The parameter £ denotes the 
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distance of the population mean X — 0 from the nearest group end-point. For 
given h the mean range S in grouped samples is obviously a symmetrical periodic 
function of £ with period h. The table has been extended to cover rather coarse 
grouping intervals (h = 2-2cr) in order to illustrate the possible bias of range when 
estimated from frequency tables with as few as two or three categories. It is apparent 
that for small or moderate group intervals, say h^cr, the mean range is practically 
independent of h and £, so that no correction (corresponding to the well-known 
Sheppard’s correction for the sample standard deviation) is required for the 

Table of mean range in grouped samples drawn from a normal 
population having unit standard deviation 

Size of sample =n. Width of group interval= h. 

Distance of population mean to nearest group-end point = £. 


h 

£ 

n = 5 

u = 10 

» = 20 

0-2 

0-0 

2-32 593 

3-07 751 

3-73 495 

0-0 

0-0 

2-32 593 

3 07 750 

3-73 500 


0-2 

2-32 693 

3-07 751 

3-73 492 

1-0 

0-0 

2-32 532 

308 122 

3-72 917 


0-2 

2-32 674 

3-07 805 

3-73 317 


0-4 

2-32 042 

3 07 450 

3-73 962 

14 

0-0 

2-31 042 

3-06 204 

3-82 069 


0-2 

2-31 626 

3-06 787 

3-78 826 


0-4 

2' 32 938 

3-08 095 

3-71 575 


0-0 

2-33 990 

3-09 143 

3-65 796 

1-8 

0-0 

2-29 22? 

2-90 539 

3-67 974 


0-2 

2-30 022 

2-94 491 

3-69 653 


0-4 

2-32 023 

3'04 621 

3-73 085 


0-6 

2-34 276 

3-16 366 

3-76 263 


0-8 

2-35 734 

3-24 140 

3-77 838 

2-2 

0-0 

2-36 011 

2-77 080 

3-27 509 


0-2 

2-35 652 

2-81 669 

3-34 611 


0-4 

2-34 221 

2-94 307 

3-53 896 


0-6 

2-32 266 

3-11 581 

3-79 602 


0-8 

2-30 265 

3-28 173 

4-03 847 


1-0 

2-28 963 

3-38 358 

4-18 452 


range. For h = 0-2cr the mean range in the grouped sample agrees with the theo¬ 
retical ungrouped range to five places of decimals (see Table of Mean Range, 
Table XXII of Tables for Statisticians and Biometricians, Part II). For coarse 
grouping the correction becomes important but depends on £ (as well as on h ). 
For fixed h, as £ varies from — \h to 4- \h the grouped mean range oscillates about 
the true mean range as a smooth single-period function. The reason for this is 
obvious. If £ has a position such that the average positions of x n and aq both 
happen to fall within the outside halves of two group intervals, then £ w -£i will 
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on the average, be smaller than x n — xp, vice versa, if the average positions of 
x n and are in the inside halves of two group intervals, there will be a pre¬ 
dominance of samples for which E, n - is larger than x n — x v 

Moreover, as h increases, the grouped mean range becomes a less reliable 
estimate, and it can be shown that for h>cr the standard deviation of the 
random sampling 'distribution of the grouped range rapidly increases with h. 

We are thus led to consider two problems; one is the elimination of the para¬ 
meter i (or the dependence of the distribution on the position of the parental 
population mean); the other is to investigate more closely the random sampling 
distribution of the grouped range. Before dealing with these problems, however, 
we must first consider the distribution of the true range (range in the ungrouped 
sample). 


4. The probability integral or the range in random samples 


As before we denote by x v ..., x n the observations in a random sample drawn 
from a parental distribution fix) and arranged in ascending order of magnitude. 
The range in such a sample, defined as w = x n — x x , may be regarded as the limit 
of the grouped range £ (l - as h, the group interval, tends to 0, i.e. 

w = lim 
h~> o 


The probability integral of the range w, denoted by P n (W), is therefore the limit 
of P(n,h, m-1, £), given by (3), as h tends to 0. To obtain this limit we write 
equation (3) as follows: 


+ co / \n ( n+(i+m)h \n 

P{n,h,m- 1,£)= £ ( f(x)dx\ - f{x)dx\ 


-hoo 

= £ n 


(/: 


•£+(i+m)h \»—1 _ 

f(x)dx\ f(JU)h, 
et ' 


where is some mean value between £ + ih and £ 4- {i + 1) h. 

We now put m = W/h or W — mh, 

and let h tend to 0, m to oo, keeping W constant. We obtain without difficulty 


P n {W) = lim P{nJi,m—l,E,) 

7t->0 

7n~>co 


f + *> / \n~l 

= fW dx ) d £> ( 1? ) 

which is the required probability integral of the range. This integral may be 
compared with the expression for the distribution function of w which was given 
by A. T, McKay & E, S. Pearson (1933). It is easily verified that the function 
(f>(w) given by these authors is the differential of P(W). The expression for P{W) 
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is decidedly simpler than that for <f>(w) which was used by Pearson for numerical 
work on this function. However, even P(W) is of a complex character, and only 
in special cases is it possible to evaluate it analytically. For the rectangular 
distribution function (f(x) = 1, 0 < x ^ 1) this can be done easily. 


5. The probability integral of the range in samples 

FROM A NORMAL POPULATION 

Of particular interest is the case where the parental population is normal. 


In this case we have 


f{x) = *(*) = 


V( 27r ) 


-Jz 2 


(18) 


L. H. C. Tippett (1925), E. S. Pearson (1926,1932) and A. T. McKay & E. S. Pear¬ 
son (1933) have considered this problem and carried out extensive numerical work, 
The method adopted was to calculate correct values of the means and standard 
deviations of the distributions (as functions of n) and then, with the help of approxi¬ 
mate values of Ji 1 and /? 2 use as approximations to the unknown true distribution 
Pearson-type curves fitted by the method of moments. The numerical results, 
although they have been successfully tested by experimental sampling, have, 
of course, an unknown accuracy. It is therefore desirable to find a method which 
produces P n ( W) to known and sufficient accuracy. 

We have P ?l (lT) = nj 2(1)^] z(x)dxj d£ 

r-iw f+.oo 

= n\ -t-n ~h+h ( sa y)- 

J-<*> J-iW 

Writing rj = -(£+ W), - £ - 7) + W, 

we obtain PJW) = nf z(-ij- W)( f z{x)dx\ dt} + I 2 . 

J -iw \J -(v+m 1 

Using the symmetry of z(x) we may write 

P oo f fy+W \n—l 

P, t (Tf) = «.J z(x)dxj drj +I z , 

and writing £ as a variable of integration in place of tj we find 

rao j r(+W \»-l 

P n (W)~nj^[z(£+W) + z(£)]^ z(x)dxj d£, 

i>w t rs+w \«-i 

or p n (lf) = -wj ^[2(£+F)-z(£)](J^ z{ ? )dx ) ^ 

/ rt+w \»-i 

2nj z(£+ W) z(x)dxj d£. 


+ : 


(19) 



342 


The range in random samples 


Integrating the first integral and introducing u = g + W in the second integral 
we finally obtain 


PJW) 


[ r+iW \n / (*« \m-l 

= | z(x)dx) +2 n) z(u)\\ z{x)dx\ 
U-i» 7 / Jiiv \Ju-w ! 


du. 


( 20 ) 


For large values of W this is an approximate solution of the problem since the 
second term in (20) is small, so that the first term 



gives a fair approximation to P n ( W). This expression denotes the chance of 
observing samples with observations all lying between — JJF and + 1W ; all 
these samples have a range smaller than or equal to W. For large W it is these 
samples which constitute an ever-increasing proportion of the total number of 
samples with range < IF. 

The second term in (20), which is always positive, takes into account all those 
samples which are not contained in the interval —\W to + -}W. This term cannot 
be ignored if high accuracy is required and if W is small or moderate. Never¬ 
theless, the work involved in the numerical integration has been considerably 
reduced, for the range of integration is now from +}W to + co. 

The numerical integration of 


*£0 / fU \»~1 

z{u) I z(x) dx I du 
J ijr \J u-w / 


is best carried out simultaneously for values of n forming an arithmetical pro¬ 
gression. For fixed u and IF, the integrand iB then given by the terms of a geo¬ 
metrical progression with, say, 


as first term and 


z(u)(\ z(x)dx\ 
\Ju-w 1 

(f“ z(a;)da;) 

\J u-w ! 


2 


as common ratio. Such a geometrical progression can be produced automatically 
by certain modern calculating machines. This forms the basic idea of the actual 
computation of the probability integral which is described in detail in another 
paper (pp. 309-10 above). 


6. The range in' randomly grouped samples 

The results of sections (2) and (3) on the effect of grouping on the distribution 
of range depend on the parameter £, which denotes the origin of the equi¬ 
distant set of group end-points 

£+ih i = 0,1,2,.... 

-h-2, ..., 

given by equation (1). In practice, however, all we know is the category breadth, h. 
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We then select the actual group end-points £ + ih from considerations which 
are, by necessity, independent of the position of the mean X of the parental 
population f(x), because this position will generally be unknown. One of our 
group end-points, however, is bound to fall into the interval 

X-\h to X + lh, 

wherever this interval may lie. Now, since the origin £ in our system of group 
end-points is wholly arbitrary we may assume that for given h we have the 
inequality X-±h<teX + $h. 

The fact that our group end-points (and therefore their origin £) are chosen in¬ 
dependently of X has now to be expressed in mathematical terms. This is done 
by assuming that we are dealing with a population of values of £ (being the origins 
of corresponding systems of group end-points) which are rectangularly distributed 
in the interval X—\h^^X + \h. This condition is exactly fulfilled where 
grouping has been introduced through rounding off of data (example (a) on 
p. 335), and it is often an appropriate assumption in the other examples, as in 
many other cases of grouping which occur in practice.* 

In order to derive the distribution of range in samples randomly grouped in 
the above sense we have to return to section (2). In this section we derived the 
probability P(n,h,m — 1, £), giving the chance that the difference between the 
centre points £ n and £ x of the highest and lowest category covered by a sample 
of n items is at most where h is the constant category breadth and 

group end-points are given by (1). The frequency distribution of £ n —£ t which 
we may denote by h, m, £) is then given by 

4>{n, h, m, £) = P(n, h, m, £) -P(n,h,m- 1, £), 

and represents the chance that £ w — £ x is exactly mh, given a particular value of £. 
The corresponding frequency distribution for random grouping may be denoted 
by h, m), To derive it we may apply Bayes’s Theorem and obtain 

1 rx+ih 

cf>(n, h, m) = T <f>(n, h, m, £) 

nj X- ih 

The resulting cumulative probability may therefore be defined by 

m 1 /*X+i h vi 

P(n, h,m) = S <K n > ll >5) = t S 

which yields the corresponding relation for the cumulative probabilities 

1 [X+ih 

Pin , h, m) = 7 P(n, h, m, £) 

njx-ih 

* In certain cases, when grouping is very coarse, it may be advantageous to use an estimate 
x of the population mean X (either dependent or independent of the sample whose range is con¬ 
sidered). The increase in information is akin to that given by an ancillary statistio in estimation 
theory. However, from the results in § 3 it would appear that little information is gained where 
the grouping interval is small or moderate. 
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Substituting, now, the expression (3) for P{n, h, m, £) we obtain 

1 +00 jVY+iA j l r£+(i+m \-l)h 


P(n ! h,m) = j £ 

CO J X-lh WJl 

-y: 


£+«i 

M-(m+l)A 


f(x)dx\ 


\n / r£+(iH m+l)/t \n\ 

f(x)dx\ 

J \J £+(£+l)ft / J 


\ n / rji+mh \ Ml 

f(x) dxj jd£. 




(21) 


This formula may be reduced to a simpler form in which its relation to the pro¬ 
bability integral of the true range (17) becomes apparent. We introduce the 
second integral 

rw 

W) = j o 

I'W /* + “ / P£+ w \ n—l 

= Jo wj /(£)y^ /(*) dx j dgdw. (22) 

The first integration is with regard to w and the integrand is an integral with 
regard to £. If, now, in this latter integral y\ — £ + w is used as variable of integra¬ 
tion in place of £ we have 


IF 


F n {W) = n f{V-w) 


H-co 


i(T f(x)dxY~ l dridw. 

\J t)-V) J 

We now note that the integrand may be written as a differential with regard to w. 
Thus, interchanging the order of integration we obtain 

z’+oo I'W d I fy 

r + oo / rtj pi 

-J |J w f( x ) dx J (23) 

thus eliminating integration with regard to w. The surviving variable of integra¬ 
tion i] may now be replaced by £ = t? — W so that we reach the final result 


-L (J £ > 


f(x)dx\ d£. 


(24) 


We now observe that this integral is identical with the one occurring in the 
expression (21) for P(n, h, m), and we note the relation 


P(n, h,m) = ^ {F n (m +1 h) - F n (mh)} 


1 I*(m+ l)h 

= t P n (w)dw. (26) 

n 'J mh 

This simple formula makes it possible to obtain the effect of random grouping on 
the probability integral of range P n (W). In particular, for normal samples for 
which P n (TT) has been tabulated at the fine interval of Aw = 0-05, the second 
integral F n (W) is easily obtained numerically by summation of the tabular entries 
in the table on pp. 302-7 above, 
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For h-> 0, m-*oo, mh -> W, we note that, as expected, P(n,h,m)ZZP n {W)] 
that is, as grouping becomes finer and finer the probability integral of the grouped 
range tends to that of the true range. What is not quite obvious, however, is the 
identity of mean grouped range and mean true range, no matter how large the 
breadth, h, of the randomly placed groups. This point we shall now examine. 

If we denote by w{n, h) the mean range in samples of n items classified in 
groups of breadth h randomly placed, we have by definition 

00 oo 

w(n,h)= £ <j>(n,h,m)mh = £ {P(n,h,m)~P(n,h,m-l)}mh. (26) 

/««() m-=0 

We now introduce central differences of the function F n (lF) and use the notation 

A' m+i = F n {m + Th)-F n {mh), A" m = F n (rn + 1 h) - 2 F n {mh) + F n {m -1 h), (27) 
so that we obtain from (25), (26) and (27) 


We may now write 


w(n, h) - £ 

wi — X 

M 

iv(n,h ) = lim £ A" m m 

A/—m=-1 


= lim [ MA' M+i - 

M —> cc l, 7H—0 / 

= lim {MA' M+i -F n (Mh)}. (28) 

jtr-».oo 


On the other hand, we have for the mean true range (w n say) 

w n = wf n (w)dw, 
d 

where w) — -j--P n (w) is the distribution function of the true range. We, 
therefore, have 


r Mh 

w n = lim wf n (w) dw 

M~*xJ 0 


= lim \MhP % 

M-+00 


% Mh 

\(Mh) - P n {w)dw . 
Jo 


iv(n,h)-w n = ]im Mh It A' M+i -P n (Mh) 

M~+ CO 


On taking the difference of (28) and (29) we see that 


Now 

where 
so that 


1 _ 1 f (Jkf+1)ft 

h AM+ *~h} Mh 


P n (w)dw=P n (Mh)+f n (w*)h*, 


Mh<w*<(M+l)h and h*^h, 
w(n, h)~w n | < lim f n {w*)Mh *. 

M—> oo 


(29) 
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From this relation it is obvious that w(n, h) = w n , since f n {w) has contact of at 
least second order as w->oo.* 

We have proved, therefore, that if grouping is random the expectation! of 
the mean range w(n, h) (mean grouped range) is identical with the true mean range 
w n so that no bias is introduced through random grouping, no matter how large 
the grouping interval h. However, if we wish to use - £ x as an estimate of w n 
this estimate, although unbiased in the sense defined, becomes less and less 
reliable as h increases. This is borne out by its random sampling distribution or 
its probability integral P(n, h, m) given by (25). For normal samples it is an easy 
matter to tabulate P(n,h,m) from the table of P n {W) (pp. 302-7) and thus to 
follow up the numerical increase of its standard deviation as h increases. How¬ 
ever, to cover the case of a general parental distribution f(x), we shall derive an 
analytical formula for the variance of f B — from which approximate numerical 
results are easily obtained. 

In order to obtain this formula we consider the second moment of £ n — 
which we may denote by /t 2 (n, h). We have by definition and from equations (25), 
(26) and (27) 

M 1 

/j, 2 (n,h) = lim X 

iU —n 
M 

- lim £ hm{m- s r\)A'! m —\hw n . (30) 

M~> °om=0 

Now we may write 

M M M 

2 m(m + £)d;= £ *}- S 

«i“0 m=0 m*=Q 

M -1 M-l 

= M(M + 1)2 {m + l)A' m+i — M.F n (Mh)+ 2 F n (mh) 

m=U to=o 

M-l 

= M(M+l)A' M+ t-(2M-l)F n (Mh) + 2 £ F n {mh). (31) 

0 

Using equation (31) we obtain for the second moment (30) 

H 2 {n, h) = Jim^ {[Mhf 11 +~j - 2MhF n (Mh) 11 ) + 2 h^FJmh)-\w n j. 

(32) 

This formula enables us to compare /i 2 (n, h) with //. 2 (w), the second moment of 
the distribution of the true range. We have by definition 

rMh 

M n ) = 6m f n (w)w 2 dw, 

* It can be proved that the order of contact of f n {w) is the satno as that of the parental dis¬ 
tribution f(x). 

t If repeated samples were drawn from the same population and the same grouping system 
used in each case, the mean grouped range would be biased by an unknown amount. But in repeated 
experience with different populations the expectation of this bias is zero. 
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w liich we transform by two partial integrations into the equation 

//■ 2 («) = lim l{Mh) 2 P„(Mli)~2(Mh)F„(Mh) + 2 [^F^dw 
1 Jo 

Taking the difference of (32) ancl (33) we obtain 


/i 2 (n, h) - /t^n) 


- Km /***-*&<**>-*» 


itr~>ce ( 2 


2 


"/ 


[ -«-i r.wi 

+ lim 2/i S F„{mh) + hF n (Mh)-2 F n (w)dw 

a/-*-® l )»=o Jo 


(33) 


(34) 


Since the expected mean value of the grouped range is the same as for the 
true range, the difference in second moments about zero equals the difference in 
variances. The first term on the right-hand side of equation (34) is obviously 0 
(see equation (29)), whilst the second term is best evaluated with the help 
of Gregory’s formula for numerical integration (see e.g. L, J, Comrie, 1936, 
p. 809). Using this formula we can express the difference between the integral 
and the finite sum in equation (34) in terms of the differences of the integrand 
F n (w) at the two ends of the range of integration. We obtain 


l. 2/ h 


/t t {n,h)-ji. i (n) = lim .{W, 

I, 


M-y 


•^ + f£{^-i + 4» + J 


lh(h-A' i) + ~hA1 + ..., 


(35) 


provided the Gregory expansion is convergent.* A\, A{, A'f, ... are advancing 
differences of the function F n (w) at w = 0. 

Equation (35) yields the desired formula for the second moment of £ m -£i. 
For most parental distributions the resulting probability integral of the range 
will be practically 0 for a certain range in the neighbourhood of the origin (see for 
instance the behaviour of P n ( IT) from a normal parent given in the table on 
pp. 302-7). For such parents and for moderate h we have 

fs... ^0, 

so that /f. a (», h)—/i 2 (n)^%h 2 , (36) 

For small or moderate values of h, therefore, the increase in variance of the 
grouped range is given approximately (and for most parents to a high degree of 
accuracy) by %h 2 . This increase is double the amount given by the well-known 
Sheppards correction of -^h 2 . Indeed, had we grouped a sample of true ranges w 
in fixed categories of breadth 7i, the resulting second moment of the grouped 
distribution of w would have an expectation which is -^h 2 in excess of the second 
moment of the true range. With random grouping of the original sample (as it 

* This condition is as a rule fulfilled for values of h which do not exceed the standard deviation 
of the parental distribution f(x). 
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has been defined above in accordance with common practice of grouping) an 
additional uncertainty is introduced by using a new, randomly selected, set of 
group intervals each time a new grouped range is determined, This additional 
uncertainty has been proved roughly to double the excess of the variance and the 
result is an increase of jh 2 over the variance of the true range. 

I wish to acknowledge with gratitude the helpful suggestions and criticisms 
made by Drs J. Wishart and J. 0. Irwin and by Professor E. S, Pearson at 
various stages of this investigation. 
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(i) The Second Yearbook of Research, and Statistical Methodology Books 
and Reviews. Edited by Oscar Krisen Btjros. The Gryphon Press, 
Highland Park, New Jersey, 1941. $5. 

This is a second and much onlarged issue of a volume published in 1938. It contains 
nearly seventeen hundred review excerpts on 346 statistical and allied books (in the 
English language only), extracted from 283 different journals. The editor has attempted 
with considerable success to cover the whole field of statistical and probability theory, as 
well as their applications in every possible direction. He has also included reviews of a 
number of books on the general history of science, on scientific method and on the social 
relations of science on the ground that they are—or should be—of general interest to 
scientific workers in every special field. Included in this category are books such as 
J. D. Bernal’s The Social Function of Science, J. G. Crowther’s The Social Relations of 
Science and J. B. S. Haldane’s The Marxist Philosophy and the Sciences. 

This large volume of several hundred pages has been admirably produced and arranged. 
It is intended to publish a fresh volume every two years containing reviews that have 
appeared in the interval. The Preface sets out a variety of reasons which, in the Editor’s 
opinion, justify the present venture and oven its enlargement in the future if sufficient 
support is forthcoming; at the same time frank expressions of opinions are asked for from 
readers and reviewers. 

The objectives of the Yearbook as set out may be classed under four general heads: 

(1) To help students, teachers and librarians to select text-books with greater dis¬ 
crimination and to point out to them the weak and strong points of particular books. 

(2) To indicate the width of the subject of statistics and the many fields in which it 
is applied. 

(3) To make students and teachers aware of the inadequacy of muoh that is now pre¬ 
sented in text books and classes; to discourage the publication of books written by persons 
ignorant of the latest developments in their subject. 

(4) To improve the quality of reviews by enoouraging editors and reviewers alike to 
take their responsibilities more seriously. 

With the last three objectives it is hardly possible to quarrel, and it is likely that the 
wide circulation of this volume would provide one of the most direct methods of attaining 
these ends. The first objective is, however, presumably the most important, and there are 
bound to be differences of opinion on the probable success of the book in this direction. In 
the ordinary event the teacher will no doubt be made aware of new books in the field with 
which he is conoerned by reading the notices in one or two journals specially devoted to his 
subject. Having obtained a suggestion of a likely book he must surely get hold of it and 
determine by reading it himself whether it is suitable and abreast of the latest develop¬ 
ments. If he is not competent to do this, but must base his decision on the advice of 6-10 
reviewers, it seems doubtful whether he should be teaching the subject at all. 

After reading through the reviews on some dozen books contained in the present 
Yearbook, I am inclined to the following conclusions. Regarding books of outstanding but 
perhaps rather controversial character, as those of Harold Jeffreys and Richard von Mises, 
the reader will certainly gain a useful impression from the collected reviews. This is partly ■ 
because in such eases the standing of the reviewers is high and their reviews interesting and 
fairly written, even if critical. But in the case of the more elementary text book* thb ■ 
position is rather different. Quite often the opinions expressed are diametrically opposite. 

In cases where I knew nothing of the book or its author I found myself inevitably forced 
to form an opinion from my own personal knowledge of the experience, the special interests 
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and even the character of the reviewer. Such inside information will generally not be 
possessed by the-College instructor and certainly not by the student. We cannot, I think, 
escape the conclusion that the teacher who has to select a text book for his students must 
be competent to decide on its merits himself and, if he is not, he may be only confused by 
the varied opinions contained in the Yearbook. 

Three future directions'in which the volume might be enlarged are contemplated: 

(1) The inclusion of reviews of foreign language (i.e, not English) books, 

(2) The addition of a section devoted to non-critical abstracts of periodical literature 
on research and statistical methods. 

(3) The publication of original criticisms by one or more persons (according to the 
importance and controversial nature) of articles and papers in the periodical literature. 

The first addition is clearly desirable; the .publication of translations of reviews in 
foreign journals of our own American and British books would probably be useful too; it 
would help us to see ourselves as others see us. With regard to the second and third pro¬ 
posals, the great difficulty is of course to secure the services of sufficiently competent 
abstractors or critics for so large an undertaking. If, quoting the Editor, the statistical 
student and teacher are to be kept ‘abreast of modem developments in statistical theory’; 
to be warned ‘to ignore much of the literature which either presents nothing new or presents 
inefficient or incorrect methods of statistical analysis’; to be told what are ‘sloppy, value¬ 
less, and erroneous articles’ and what are ‘well-written, significant contributions’, it is 
clear that a very great responsibility will lie on the Editor of the Yearbook and his col¬ 
laborators. As Prof, Buros indicates, the organizing and editing of such a comprehensive 
service would need the support of a foundation interested in fostering the advancement of 
research. Indeed, to avoid duplication the organization must be built up on an inter¬ 
national basis, possibly in collaboration with such bodies as the American Statistical Asso¬ 
ciation and the Royal Statistical Society between whose representatives some discussion 
of a similar project took place a few years ago. ESP 
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